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included. 

A FIRST preliminary amendment 



A SECOND or SUBSEQUENT preliminary amendment 
A substitute specification. 

A change of power of attorney and/or address letter. 

A computer-readable form of the sequence listing in accordance with PCT Rule 13ter.2 and 35 U.S.C. 1.821-1.825. 
A second copy of the published international application under 35 U.S.C. 154(d)(4). 
A second copy of the English language translation of the international application under 35 U.S.C, 154(d)(4)., 
_ Other items or information: PCT/RO/101 (4 pp.); PCT/IPEA/401 (3 pp.); PCT/lPEA/409 (5 pp.); PCT/ISA/210 (9 pp.); 
Copy of WO 00/55330 published September 21, 2000 w/PCT/ISA/210: Specification (27 pp.), Claims 1-40 (3 pp.), 4 sheets of 
Drawings, Abstract 



FORM PTO 1390 (REV 11-2000) page 1 of 2 
Doc 384.406 



JC16 Rec'd PCT/PTO SEP 1 8 2001 



IMTERNATIOKAL APPIKATIONNO. 

PCT/GBOQ/01035 



ATTORNEY'S DOCKET NO. 

05407.00003 



17. H The following fees are submitted: 

Basic National Fee (37 CFR 1.492(a)(1)-<5): 

Neither international preliminary examination fee (37 CFR 1.482) 

nor international search fee (37 CFR 1-445(a)(2) paid to USPTO 

and International Search Report not prepared by the EPO or JPO $1,000.00 

International preliminary examination fee (37 CFR 1.482) not patdto 

USPTO but Internationa! Search Report prepared by the EPO or JPO. $ 860.00 

International prefiminary examination fee (37 CFR 1.482) not paid to USPTO but 

international search fee (37 CFR 1455(a)(2)) paid to USPTO $ 710.00 

International preliminary examination fee (37 CFR 1.482) paid to USPTO) 

but all claims did not satisfy provisions of PCT Artkjle 33{1)-(4) $ 690.00 

International preliminary examination fee (37 CFR 1.482) pakJ to USPTO 

and all claims satisfied provisions erf PCT Article 33(1)-(4) ... - $ 100.00 

ENTER APPROPRiATE BASIC FEE AMOUNT 



CALCULATIONS I PTO USE ONLY 



$860.00 



Surcharge of $130.00 for furnishing the oath or declaration later than □ 20 □ 30 months from the 
claimed priority date (37 CFR 1.492(e). 



CLAIMS 



NUMBER FILED 



NUMBER EXTRA 



RATE 



Total Claims 



43 



-20 = 



23 



X $18.00 



$414.00 



Independent Oaims 



1 



.3 = 



X$ 80.00 



Multiple dependent claims (if applicable) 



X $270.00 



TOTAL OF ABOVE CALCULATIONS 



$1.274.00 



□Applicant claims small entity status. See 37 CFR 1,27. The fees indicated below above are reduced 
by 1/2. 



SUBTOTAL = 



$1.274.00 



Processing fee of $130.00 for furnishing the English translation later than □ 20 □ 30 months from the 
earliest claimed priority date (37 CFR 1.492(f). 



TOTAL NATIONAL FEE^ 



$1.274.00 



Fee for recording the enclosed assignment (37 CFR 1.21(h). The assignment must be accompanied by an 
appropriate cover sheet (37 CFR 3.28, 3.31). $40.00 per property. 



$ 40.00 



TOTAL FEES ENCLOSED = 



$1,314.00 



Amount to be: 

refunded | $ 



charged | $ 



A check in the amount of $ to cover the above fees is enclosed. 

Please charge my Deposit Account No. 19-0733 In the amount of $1.314.00 to coverthe above fees. A duplicate 
copy of this sheet is enclosed. 

The Commissioner is hereby authorized to char^ any additional fees which may be required, or credit any 
overpayment to Deposit Accoiffit No. 19-0733. A duplicate copy of this sheet is enclosed. 
Fees are to be charged to a credit card. WARNING: Information on this form may become public. Credit card 
Information should not be Included on this form. Provide credit card infomnation and authorization on PTO-2038. 
NOTE: Where an appropriate time limit under 37 CFR 1.494 or 1.495 has not been met, a petition to revive (37 CFR 
1 . 1 37(a) or (b)) must be filed and granted to restore the application tojpending status. 



□ 
lEI 

□ 



SEND ALL CORRESPONDENCE TO: 

Banner &Witcoff, Ltd 
Eleventh Floor 
1001 G Street. N.W. 
Washington, D.C, 20001-4597 
Telephone (202) 508-9100 
Date: September 18. 2001 



IGNATURE Q 



SIGNATURE 
Lisa M. Hemmendinaer 



NAME 



42.653 



REGISTRATION NUMBER 



FORM PTO 1390 (REV 1 1-2000) page 2 of 2 
Doc 384,406 



09/936845 

JC16Rai'dPCT/PT0 SEP 1 8 2001 



Attorney Docket No. 05407.00003 
Liteniational Application No, PCT/GBOO/01035 

PATENT APPLICATION 

IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

In re application of BOX PCX 

National Phase Application 
Johnathan A. NAPIER . PCT/GBOO/01035 

FUed: March 20, 2000 

Serial No.: Unassigned Group Art Unit: Unassigned 

Filed: CONCURRENTLY HEREWITH Examiner: Unassigned 

For: POLYS ATURATED FATTY ACID (PUFA) ELONGASE FROM CAENORHABDmS 
ELEGANS 

LETTER PURSUANT TO 37 CFR 1.821(f) 

Assistant Honorable Commissioner of 

Patents and Trademarks 
Washington, D.C. 20231 

Sir- 
In the matter of the above-identified application, which is filed concurrently herewith, 
Applicants submit a computer diskette containing the sequences of the instant application. It is 
hereby certified that the paper and computer copies of these sequences are identical in content. 

Respectfiilly submitted, 



Lisa M. Hemmendinger 
Reg. No. 42,653 




September 18, 2001 
BANNER & WITCOFF, LTD. 
Eleventh Floor 
1001 G Street, N.W. 
Washington, D.C. 20001-4597 
(202) 508-9100 



Pfmcnif^^- 06 MAR 2002 



PATENT 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 
In re Application of 
NAPIER 

Serial No. 09/936,845 
Filed: September 28, 2001 



Group Art Unit: TBA 
Examiner: TBA 



Atty.Dkt.No. 05407.00003 

For: POLYUNSATURATED FATTY ACID (PUFA) ELONGASE FROM 
CAENORHABDITIS ELEGANS (AS AMENDED) 



AMENDMENT 

Assistant Director for Patents 
Washington, D.C. 20231 

Sir: 

This amendment is filed in response to the Notification of Missing Requirements mailed 
January 8, 2002, Accompanying this amendment are: 

• a paper copy of a substitute sequence listing; 

• a computer readable form of the substitute sequence listing; and 

• a copy of the Notification of Missing Requirements. 

We believe no fee is due in coimection with this amendment. If a fee is due, please charge our 
Deposit Account No. 19-0733. 

Please enter the following amendment. 



IN THE SPECIFICATION 

(1) Delete the sequence listing insert the paper copy of the substitute sequence Hsting at the 

1 



end of the application. 



Remarks 



Paper and computer readable forms of a substitute sequence listing accompany this paper. 
The contents of the substitute sequence listing are identical to the sequence listing originally filed 
except for the identification of amino acid "variants" in SEQ ID NO:22. The substitute sequence 
listing adds no new matter to the application. 

I believe the contents of the computer readable form and the paper copy of the substitute 
sequence listing are identical. 



Respectfully submitted. 





Lisa M. Hemmendinger 
Registration No. 42,653 




Banner & Witcoff, Ltd. 
1001 G Street, N.W., Eleventh Floor 
Washington, D.C. 20001-4597 
(202) 508-9100 
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M THE UMTED STATES PATENT AND TRADEMARK OFFICE 
In re Application of 



NAPIER 

Serial No. TBA 

Filed: September 28, 2001 



Group Art Unit: 
Examiner: 

Atty.Dkt.No. 05407.00003 



For: POLYUNSATURATED FATTY ACID (PUFA) ELONGASE FROM 
CAENORHABDITIS ELEGANS (AS AMENDED) 



PRELIMINARY AMENDMENT 

Assistant Commissioner of Patents 
Washington, D.C. 20231 

Sir: 

Please enter the following amendments before examining the application referenced 
above. We believe no fee is due in connection with this filing. If a fee is due, please charge 
Deposit Account No. 19-0733. 

Appendix 1 is a copy of the amended paragraphs, with markings to show changes made. 
Appendix 2 is a copy of the amended claims, with markings to show changes made. 
IN THE SPECIFICATION 

(1) On page 1, delete the title and substitute therefore: 

POLYUNSATURATED FATTY ACID (PUFA) ELONGASE FROM CAENORHABDITIS 
ELEGANS 

(2) On page 1, after the title, insert the following paragraph: 

This application claims the benefit of and incorporates by reference the following applications: 
PCT/GBOO/01035 filed March 20, 2000, 9906307.5 filed March 18, 1999, and 0003869.5 filed 



February 19, 2000, all of which were published in English. 

(3) On page 3, delete the first foil paragraph and substitute the following paragraph: 

In order to identify genes encoding PUFA elongases, it is necessary to study systems in which 
the synthesis of PUP As is well documented; a good example of this is the model animal system 
C elegcms, a small free-living worm (Tanaka et al, (1996), Lipids 31, 1173-78). C elegcms, like 
most other animals, and in contrast to higher plants, synthesizes PUFAs such as arachidonic acid 
(AA; 20:4 A^' ^' as precursors to a class of molecules known as the eicosanoids, which in 
turn serve as precursors for compounds such as prostaglandins and leucotriens (Horrobin, (1990), 
Review in Contemp Pharmacotherapy, 1:1-45). The presence of AA and other long chain 
polyunsaturated fatty acids in C elegans is well documented (Tanaka et aL, (1996), Lipids 1, 
1173-1178). The complete sequence of the nematode's genome is now publicly available (The 
C elegans cosortium, 1998, Science 282, 2012-2018, See the database at the website identified 
with the URL file type, www host server, domain name sanger.ac.uk and foUov^ng the path 
from "Projects" to "C_elegans" to "blast^server.shtml". 

(4) On page 7, delete the paragraph beginning on line 18 and extending to page 8, line 21 and 
replace it with the following paragraph: 

Initially the C elegans databases were searched for any sequences which showed low levels of 
homology to yeast ELO genes {EL02 and EL03) using the TBLASTN programme, A similar 
search was carried out using short (20 to 50 amino acid) stretches of ELO genes which were 
conserved amongst the three ELO polypeptide sequences. C elegans sequences which were 
identified by this method were then used themselves as search probes, to identify any related C. 
elegans genes which the initial search with the yeast sequences failed to identify. This was 
necessary because the level of homology between the yeast ELO genes and any worm genes is 
always low (see BLAST scores later). To allow for a more sensitive search of worm sequences, 
a novel approach was adopted to circumvent the major drawback with searches using the BLAST 
programmes, namely that the search string (i.e. the input search motif) must be longer than 15 
characters for the algorithm to work. Thus, if it was desired to search for a short motif (like a 



histidine box), then the BLAST programme would not be capable of doing this. A complete list 
of all the predicted ORFs present in the C elegans genome exists as a database called Wormpep, 
which is freely available from the Sanger WWW site identified with the URL address http file 
type, www host server, domain name sanger.ac.uk and following the path from "Projects" to 
"C_elegans" to "webace^front^end.shtml". The latest version of Wormpep was down loaded to 
the hard disc of a Pentium PC, and re-formatted as a Microsoft Word6 document, resulting in a 
document of about 3,500 pages. This was then searched using the "Search & Replace" frmction 
of Word6, which also allows for the introduction of "wildcard" characters into the search motif 
So, for example, it is possible to search both for the short text string HPGG, which would 
identify any predicted worm ORF present in the Wormpep 3,500 page document containing this 
motif^ or alternatively search with HPGX (where X is a wild card character). Clearly, such 
(manual) searches of a 3,500 page document are extremely time-consuming and demanding, also 
requiring visual inspection of each and every identified ORF, For example, searching with a 
motif such as HXXHH identifies in excess of 300 different ORFs. However, by using a number 
of different short search strings (as outlined below), and combining these with other methods for 
identifying putative elongase enzymes, a number of candidate ORFs have been identified. 

(5) On page 8, delete the paragraph beginning at line 23 to and extending to page 9, line 3 
and substitute the following paragraph: 

As a negative control, to demonstrate that the FAEl gene sequence was unlikely to provide a 
useful search sequence in the identification of C elegans sequences encoding for PUFA 
elongases, the GenBank databases identified with the URL address http file type, www host 
server, domain name "ncbi.nlm.nih.gov" and following the path fi*om Web to Search to 
index.html were searched using the Arabidopsis FAEl polypeptide sequence ot identify related 
genes or expressed sequence transcripts (ESTs). GenBank is the NIH genetic sequence database, 
an annotated collection of all publicly available DNA sequences (Nucleic Acid Research (1998) 
26, 1-7). There are approximately 2,162,000,000 bases in 3,044,000 sequence records as of 
December 1998. The search was carried out using the BLAST2 (Basic Local Alignment Search 
Tool) algorithm (Altschul et al, (1990) JMolBiolllS, 403, 410). Although a number of plant 



ORFs and ESTs were reported as being related, o animal sequences were identified by this 
search, confirming the observation that FAEl was unlikely to be a suitable candidate as a search 
template for PUFA elongases, 

(6) On page 9, delete the paragraph beginning on line 5 and substitute the following paragraph: 

Using the three yeast fatty acid elongase sequences (ELO 1, 2, 3) as probes, a number of putative 
ORFs in the DNA of C. eJegans-drnved cosmid sequences which form the C elegans genomic 
sequence database was identified. Moreover, an extensive and time-consuming search of a 
downloaded copy of the WormPep database identified with the URL address ftp file type, ftp 
host server, domain name sanger.ac.uk, following the path from "pub" to "databases" to 
"wormpep" using manual search strings inMSWord 6, identified a number of C. elegans ORFs 
which contained presumptive histidine boxes. Wormpep contains predicted proteins from the 
Caenorhabditis elgans genome sequence project, which is carried out jointly by the Sanger 
Centre in Cambridge, UK and Genome Sequencing Center in St. Louis, USA. The current 
Wormpep database, Wormpep 16, contains 16,332 protein sequences (7,120,115 residues). 
Search strings used included [HXXHH], [HXXXHH], [QXXEIH] and [YHH], Comparison of 
the data from the two different searches indicated a small (<10) number of putative ORFs as 
candidate elongases. The histidine box motifs are located at amino acids 162-166 of SEQ ID 
N0:15, amino acids 186-190 of SEQIDN0:16, amino acids 145-150 of SEQIDN0:17, amino 
acids 147-151 of SEQ ID N0:18, amino acids 141-145 of SEQ IDN0:19, amino acids 177-181 
of SEQ IDNO;20, amino acids 155-159 of SEQIDN0:21, and amino acids 233-237 of SEQ ID 
NO:22. 

(7) On page 10, delete the paragraph beginning on line 7 and substitute the following 
paragraph: 

Since the inventors had previously observed that C elegans genes involved in the synthesis of 
PUFA may exist in tandem (for example the A5 and A6 desaturases required for AA and GLA 
synthesis, respectively, are <1 kfi apart on chromosome IV (Michaelson et al, (1998), FEBS 
Letts 439, 215-218), the positions of the putative C. elegans elongase ORFs were determined 
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using the Sanger Centre's WebAce C elegans server identified with the URL address http 
filetype, www host server, domain name sanger.ac.uk and following the path fi-om "Projects" to 
C_elegans" to "webac_front_ends.shtmI". This indicated that two pairs of putative elongases 
were in close proximity to each other on the C. elegans chromosome IV. 

(8) On page 13, delete the paragraph beginning at line 2 and substitute the follov^ng 
paragraph: 

Putative elongase sequences F56H1 1.4 and F41H10.8 were cloned by PCR into the pYES2 
vector (Invitrogen). A C elegans mixed stage cDNA library was used as a PCR template. 
F56H1 L4 was amplified using primers: 

56hl 14.for 5'-GCGGGTACCATGGCTCAGCATCCGCTC-3* (SEQ ID NO: 1) and; 
56hll4.rev 5'-GC GGGATCCT TAGTTGTTCTTCTTCTT-3' (SEQ ID N0:2). 
F41H10,8 was amplified using primers: 

41hl08.for 5'-GC GGGTACCA TGCCACAGGGAGAAGTC-3' (SEQ IDN0:3) and; 
416hl08,rev 5*-GCGGGATCCTTATTCAATTTTTCTTTT-3* (SEQ ID NO:4). 

(9) On page 13, delete the paragraph beginning on line 13 and substitute the following 
paragraph: 

An ORF encoding thcMortierella alpina A^-fatty acid desaturase (Michaelson, L.V., et al. 

(1998) 1 Biol Chem., 273, 19055-19059) was amplified using primers: 

MadS.for 5'-GC GAATTCA CCATGGGTACGGACCAAGGA-3* (SEQ ID N0:5) and; 

MadS.rev 5'-GC GGAGCTC CTACTCTTCCTTGGGACG-3' (SEQ ID N0:6). 



(10) Delete the informal sequence listing at pages 22-27 and insert the paper copy of the 
formal sequence listing at the end of the application. 



IN THE CLAIMS 

1. (Amended) An isolated polypeptide comprising a functional long chain 
polyunsaturated fatty acid (PUFA) elongase having a function of extending a chain length of an 
18 carbon PUFA to 20 carbons in length. 

3. (Amended) A polypeptide according to claim 1 wherein the polypeptide comprises a 
portion of the amino acid sequence shown in SEQ ID NO: 15 or a variant thereof 

7. (Amended) A polypeptide according to claim 1 wherein the polypeptide sequence 
includes a sequence motif responsible for Endoplasmic Reticulum (ER)-retention. 

8. (Amended) A polypeptide according to claim 1 wherein the polypeptide is capable of 
elongating palmitoleic acid (PA; 16:1A^) to vacceric acid(VA;18:lA^^). 

9. (Amended) A polypeptide according to claim 1 wherein the polypeptide is an animal 
polypeptide. 

16. (Amended) An isolated DNA molecule encoding a polypeptide according to claim 1 . 

17. (Amended) A DNA molecule according to claim 16 wherein the DNA molecule 
comprises the sequence shown in SEQ ID N0:7 or variants of that sequence due to base 
substitutions, deletions, and/or additions, 

18. (Amended) An engineered organism engineered to express a polypeptide according to 
claim 1. 

21 . (Amended) An engineered organism containing a synthetic pathway for the production 
of a polypeptide according to claim L 

23. (Amended) An engineered organism according to claim 21 wherein the pathway 
includes A^-fatty acid desaturase. 
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24. (Amended) An engineered organism according to claim 21 wherein the organism is a 
lower eukaryote. 

27. (Amended) A transgenic plant engineered to express a polypeptide according to 
claim 1. 

28. (Amended) A transgenic plant containing a DNA molecule according to claim 16. 

29. (Amended) A method of producing a PUFA comprising carrying out an elongase 
reaction catalysed by a polypeptide according to claim L 

32. (Amended) A PUFA produced by a method according to claim 29. 

35. (Amended) A pharmaceutical composition comprising a polypeptide according to 

claim 1. 

37. (Amended) A pharmaceutical composition according to claim 35 wherein the 
composition comprises a pharmaceutically-acceptable diluent, carrier, excipient or extender. 

38. (Amended) A method of elevating the PUFA levels of an animal or a plant comprising 
the step of supplying to the animal or plant a polypeptide according to claim L 

39. (Amended) A method according to claim 38 wherein the animal is a mammal. 

40. (Amended) A method according to claim 39 wherein the mammal is a human. 

41. (New) A pharmaceutical composition according to claim 36 wherein the 
composition comprises a pharmaceutically-acceptable diluent, carrier, excipient, or extender. 

42. (New) A method of elevating the PUFA levels of an animal or a plant comprising 
the step of supplying the animal or plant with a DNA molecule according to claim 16. 

43. (New) A method of elevating the PUFA levels of an animal or a plant by 
supplying the animal or plant with a PUFA according to claim 32. 
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Remarks 



Amendments to the Claims and Specification 

The claims are amended to remove multiple dependencies. New claim 41-43 is 
supported by original claim 37. New claims 42 and 43 are supported by original claim 38. 

The specification is amended to delete hyperlinks and to insert sequence identifiers. 
The amendments add no new matter to the specification. 

Formal Sequence Listing 

A paper and a computer readable form of a formal sequence Usting accompany this 
amendment. I believe the sequence contents of the paper and computer readable forms are 
identical. 

The formal sequence listing adds no new matter to the specification. It contains the 
primer sequences disclosed at page 13 as SEQ ID N0S:l-6. SEQ ID NOS:7-22 are the 
sequences present in the informal sequence listing as SEQ ID NOS: 1-16, respectively. 



Respectfully submitted. 



Date: September 18, 2001 




Lisa M. Hemmendinger 
Registration No. 42,653 



Banner & WitcofF, Ltd. 
1001 G Street, N,W., Eleventh Floor 
Washington, D.C. 20001-4597 
(202) 508-9100 
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Appendix 1 , Version of Amended Paragraphs with Markings to Show Changes Made 

On page 1, in the title: 

[POLYSATURATED] POLYUNSATURATED FATTY ACID (PUFA) 
ELONGASE FROM C AENORHABDITIS ELEGANS 

Page 3, first fixU paragraph: 

In order to identify genes encoding PUFA elongases, it is necessary to study 
systems in which the synthesis of PUFAs is well documented; a good example of 
this is the model animal system C elegans, a small free-living worm (Tanaka et 
al, (1996), Lipids 31, 1173-78). C. elegans, like most other animals, and in 
contrast to higher plants, synthesizes PUFAs such as arachidonic acid (AA; 20:4 
^5, 8, 11, i4>^ precursors to a class of molecules known as the eicosanoids, which 
in turn serve as precursors for compounds such as prostaglandins and leucotriens 
(Horrobin, (1990), Review in Contemp [Pharmacotherpy] Pharmacotherapy. 1:1- 
45), The presence of AA and other long chain polyunsaturated fatty acids in C. 
elegans is well documented (Tanaka et al, (1996), Lipids 1, 1173-1178). The 
complete sequence of the nematode's genome is now publicly available (The C 
elegans cosortium, 1998, Science 282, 2012-2018 , See the database at the 
website identified with the URL file type, www host server, domain name 
sanger.ac.uk and following the path from "Projects" to "C elegans" to 
"blastserver. shtml" {Database at 

http://www.sangeKac. uk/Projects/Celegans/blastserver,shtml)] . 

Page 7, line 18 to page 8, line 21: 

Initially the C. elegans databases were searched for any sequences which showed 
low levels of homology to yeast ELO genes (EL02 and EL03) using the 
TBLASTN programme. A similar search was carried out using short (20 to 50 
amino acid) stretches of ELO genes which were conserved amongst the three 
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ELO polypeptide sequences. C elegans sequences which were identified by this 
method were then used themselves as search probes, to identify any related C. 
elegans genes which the initial search with the yeast sequences failed to identify. 
This was necessary because the level of homology between the yeast ELO genes 
and any worm genes is always low (see BLAST scores later). To allow for a 
more sensitive search of worm sequences, a novel approach was adopted to 
circumvent the major drawback with searches using the BLAST programmes, 
namely that the search string (i.e. the input search motiQ must be longer than 15 
characters for the algorithm to work. Thus, if it was desired to search for a short 
motif (like a histidine box), then the BLAST programme would not be capable of 
doing this. A complete list of all the predicted ORFs present in the C elegans 
genome exists as a database called Wormpep, which is freely available from the 
Sanger WWW site identified with the URL address http file type, www host 
server, domain name sanger.ac.uk and following the path fi"om ''Projects'' to 
"C_elegans" to "webace_front_end.shtml" 

[(http://www.sanger,ac.uk/Projects/C_elegans/webace__fi'ont_end.shtml)]. 
The latest version of Wormpep was down loaded to the hard disc of a Pentium 
PC, and re-formatted as a Microsoft Word6 document, resulting in a document of 
about 3,500 pages. This was then searched using the "Search & Replace" 
function of Word6, which also allows for the introduction of "wildcard" 
characters into the search motif So, for example, it is possible to search both for 
the short text string HPGG, which would identify any predicted worm ORF 
present in the Wormpep 3,500 page document containing this motif, or 
alternatively search with HPGX (where X is a wild card character). Clearly, such 
(manual) searches of a 3,500 page document are extremely time-consuming and 
demanding, also requiring visual inspection of each and every identified ORF. 
For example, searching with a motif such as HXXHH identifies in excess of 300 
different ORFs. However, by using a number of different short search strings (as 
outlined below), and combining these Avith other methods for identifying putative 
elongase enzymes, a number of candidate ORFs have been identified. 
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Page 8, line 23 to page 9, line 3: 

As a negative control, to demonstrate that the FAEl gene sequence was unlikely 
to provide a usefiil search sequence in the identification of C. elegans sequences 
encoding for PUFA elongases, the GenBank databases 

[(http://www.ncbi.nlni.nih,govAVeb/Search/index,htnil)] identified with the URL 
address http file type, www host server, domain name "ncbi.nlm.nih.goV and 
following the path fi-om Web to Search to index.html were searched using the 
Arabidopsis FAEl polypeptide sequence ot identify related genes or expressed 
sequence transcripts (ESTs). GenBank is the MH genetic sequence database, an 
annotated collection of all publicly available DNA sequences (Nucleic Acid 
Research (1998) 26, 1-7). There are approximately 2,162,000,000 bases in 
3,044,000 sequence records as of December 1998. The search was carried out 
using the BLAST2 (Basic Local Alignment Search Tool) algorithm (Altschul et 
al, (1990) JMolBiol 215, 403, 410). Although a number of plant ORFs and 
ESTs were reported as being related, o animal sequences were identified by this 
search, confirming the observation that FAEl was unlikely to be a suitable 
candidate as a search template for PUFA elongases. 

Page 9, line 5: 

Using the three yeast fatty acid elongase sequences (ELO 1, 2, 3) as probes, a 
number of putative ORFs in the DNA of C elegans-dQtiv^d cosmid sequences 
which form the C elegans genomic sequence database was identified. Moreover, 
an extensive and time-consuming search of a doAvnloaded copy of the WormPep 
database identified with the URL address ftp file type, ftp host server, domain 
name Sanger, ac.ulc following the path from "pub" to '"databases" to "wormpep" 
[(ftp://ftp.sanger,ac.uk/pub/databases/wormpep)] 

using manual search strings in MSWord 6, identified a number of C elegans 
ORFs which contained presumptive histidine boxes. Wormpep contains predicted 
proteins fi"om the Caenorhabditis elgans genome sequence project, which is 
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carried out jointly by the Sanger Centre in Cambridge, UK and Genome 
Sequencing Center in St. Louis, USA. The current Wormpep database, Wormpep 
16, contains 16,332 protein sequences (7, 120, 11 5 residues). Search strings used 
included [HXXHH], [HXXXHH], [QXXHH] and [YHH]. Comparison of the 
data from the two different searches indicated a small (<10) number of putative 
ORFs as candidate elongases. The histidine box motifs are [shown in bold in 
SEQ ID 9 to 16] located at amino acids 162-166 of SEP ID N0:15. amino acids 
186-190 of SEP ID NO: 16, amino acids 145-150 of SEP ID N0:17. amino acids 
147-151 of SEPIDN0:18. amino acids 14M45 of SEPIDNP:19. amino acids 
177-181 of SEP IDNO:20. amino acids 155-159 of SEP IDN0:2L and amino 
acids 233-237 of SEP ID NP:22 , 

Page 10, line 7: 

Since the inventors had previously observed that C elegans genes involved in the 
synthesis of PUFA may exist in tandem (for example the A5 and A6 desaturases 
required for AA and GLA synthesis, respectively, are <1 kB apart on 
chromosome IV (Michaelson et al., (1998), FEBS Letts 439, 215-218), the 
positions of the putative C elegans elongase PRFs were determined using the 
Sanger Centre's WebAce C elegans server identified with the URL address http 
filetype. www host server, domain name sanger.ac.uk and following the path from 
"Projects"' to C elegans" to ^'webac front ends.shtmF 
[(http://www. Sanger. ac.uk/Projects/C_elegans/webac_front_ends. shtml).] . This 
indicated that two pairs of putative elongases were in close proximity to each 
other on the C. elegans chromosome IV. 

Page 13, line 2: 

Putative elongase sequences F56H1 L4 and F41H10.8 were cloned by PCR into 
the pYES2 vector (Invitrogen). A C elegans mixed stage cDNA library was used 
as a PCR template. F56H1 1 .4 was amplified using primers: 
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56hl 14.for 5'-GC GGGTACCA TGGCTCAGCATCCGCTC-3' (SEP ID NO: n and; 
56hl 14.rev 5'-GCGSSATCCTTAGTTGTTCTTCTTCTT-3' rSE0IDN0:2Y 
F41H10.8 was amplified using primers: 

41hl08.for S'-GCG GGTACC A TGCC ACAGGGAGAAGTC-3 ' rSEOIDNO:3;) and; 
416hl08.rev 5'-GC GGGATCC TTATTCAATTTTTCTTTT-3' (;SE0IDN0:4Y 

Page 13, line 13: 

An ORE encoding the Mortierella alpina A^-fatty acid desaturase (Michaelson, 
L. v., et al. (1 998) J. Biol Chem., 273, 19055-19059) was amplified using 
primers: 

MadS.for 5'-GCGAATI£ACCATGGGTACGGACCAAGGA-3' (SE0IDN0:5^ and; 
Mad5.rev 5'-GC GGAGCTC CTACTCTTCCTTGGGACG-3' (SE0IDN0:6) . 
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Appendix 2. Version of Amended Claims with Markings to Show Changes Made 

1. (Amended) An isolated polypeptide comprising a functional long chain 
polyunsaturated fatty acid (PUFA) elongase [as herein defined] having a ftinction of extending a 
chain length of an 18 carbon PUFA to 20 carbons in length . 

3. (Amended) A polypeptide according to claim 1 [or claim 2] wherein the polypeptide 
[has at least] comprises a portion of the amino acid sequence shown in SEQ ID NO: 15 or a 
variant [variants] thereof 

7. (Amended) A polypeptide according to [any preceding] claim i wherein the 
polypeptide sequence includes a sequence motif responsible for Endoplasmic Reticulum (ER)- 
retention. 

8. (Amended) A polypeptide according to [any preceding] claim 1 wherein the 
polypeptide is capable of elongating palmitoleic acid (PA; 16: lA^) to vacceric acid (VA; 
18: lA^^). 

9. (Amended) A polypeptide according to [any preceding] claim 1 wherein the 
polypeptide is [from] an animal polypeptide . 

16. (Amended) [A] An isolated DNA [sequence] molecule encoding a polypeptide 
according to [any preceding] claim 1. 

17. (Amended) [A] A DNA [sequence] molecule according to claim 16 wherein the DNA 
molecule comprises the sequence shown in SEQ ID N0:7 or variants of that sequence due to 
base substitutions, deletions, and/or additions. 

18. (Amended) An engineered organism engineered to express a polypeptide according to 
[any one of claims] claim 1 . 
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21 . (Amended) An engineered organism containing a synthetic pathway for the production 
of a polypeptide according to [any one of claims] claim L 

23. (Amended) An engineered organism according to claim 21 [or 22] wherein the 
pathway includes A^-fatty acid desaturase. 

24. (Amended) An engineered organism according to [any one of claims] claim 21 [to 23] 
wherein the [animal] organism is a lower eukaryote. 

27. (Amended) A transgenic plant engineered to express a polypeptide according to [any 
one of claims] claim 1 [to 15]. 

28. (Amended) A transgenic plant containing a DNA [sequence] molecule according to 
claim 16 [or 17]. 

29. (Amended) A method of producing a PUFA comprising carrying out an elongase 
reaction catalysed by a polypeptide according to [any one of claims] claim 1 [to 15]. 

32. (Amended) A PUFA produced by a method according to [any one of claims] claim 29 
[to 31]. 

35 . (Amended) A pharmaceutical composition comprising a polypeptide according to [any 
one of claims] claim 1 [to 15]. 

37. (Amended) A pharmaceutical composition according to claim 35 [or claim 36] 
wherein the composition comprises a pharmaceutically-acceptable diluent, carrier, excipient or 
extender. 

3 8 . (Amended) A method of elevating the PUFA levels of an animal or a plant comprising 
the step of [by] supplying to the animal or plant a polypeptide according to [any of claims] claim 
1 [to 15, a DNA sequence according to claim 16 or 17, a foodstuff according to claim 33, a 
dietary supplement according to claim 34, a pharmaceutical composition according to any of 

15 
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claims 35 to 37 or a PUFA according to claim 32]. 

39. (Amended) A method [of treatment] according to claim 38 wherein the animal is a 
mammal. 

40, (Amended) A method [of treatment] according to claim 39 wherein the mammal is a 
human. 
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POLYSATURATED FATTY ACID (PUFA) ELONGASE FROM CAENORHABDITIS ELEGANS 

The present invention relates to polyunsaturated fatty acid (PUFA) elongases. More 
specifically, the invention relates to a DNA sequence from C elegans encoding a PUFA 
elongase. 



Unsaturated fatty acids are essential components required for normal cellular function, being 
involved in a diverse number of roles ranging from membrane fluidity to acting as signal 
molecules (Gill, L, Valivety, R. (1997). Trends Biotechnol 15, 401-409; Broun, P., et al 
(1999) Am. Rev. Nutr, 19, 197-216). In particular, the class of fatty acids known as the 
polyunsaturated fatty acids (PUFAs) has attracted considerable interest as pharmaceutical 
and nutraceutical compounds (Broun supra; Horrobin, D. R (1990) Reviews in Contemp 
Pharmacotherpy 1, 1-45). 

The synthesis of PUFAs i.e, fatty acids of 18 carbons or more in length and containing two 
or more double bonds, is thought to be catalyzed in a variety of organisms by a specific fatty 
acid elongase enzyme. This elongase is responsible for the addition of 2 carbon units to an 
18 carbon PUFA, r^ulting in a 20 carbon fatty acid. An example of this reaction is the 
elongation of y-linolenic acid (GLA; 18:3A*'^'^^) to di-homo-y-linoienic acid (DHGLA; 
20:3A*'^ *'*'*) in which the tri-unsaturated 1 8 carbon fatty acid is elongated by the addition of a 
two carbon unit to yield the tri-unsaturated 20 carbon fatty acid. Since there is considerable 
interest in the production of long chain PUFAs of more than 18 carbons in chain length, for 
example arachidonic acid and eicosapentanoic acid, the identification of this enzyme is of 
both academic and commercial interest. 



At present, there are no examples of identified cloned genes encoding PUFA elongases, 
though a number of genes encoding enzymes Kkely to be involved in other aspects of lipid 
synthesis have been identified. For example, an Arabidopsis gene (FAEl) has been shown 
to be required for the synthesis of very long chain monounsaturated fatty acids (such as 
erucic acid; 20:1A'^) (James, D. W. et al, (1995) Plant Cell 7, 309-319). However, it is clear 
that this enzyme does not recognize di- and tri-unsaturated 18 carbon fatty acids, for 
example, linoleic acid, i8:2A^^^^ or a-linolenic acid, 18:3A^'^^**^ respectively, as substrates. 
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and is therefore not involved in the synthesis of long chain PUFAs (Millar & Kunst (1997), 
Plant Journal 12, 121-131). This in itself is not surprising, since, of the plant kingdom, only 
a very few lower plant species, such as the moss Physcomicotrella patens (Girke et al, 
(1998X Plant J, 15: 39-48); are capable of synthesising long chain PUFAs, and therefore 
Arabidopsis would not be expected to contain any such enzymes (Napier et al (1997), 
Biochem J, 328: 71 7-720; Napier et al, (1999) Trends in Plant Sci 4, 2-5). 

A schematic diagram representing a generalized pathway for the product of PUFAs is 
shown in Figure L Biochemical characterisation of mammalian elongation systems (most 
notably from liver microsomes) has indicated that a mammalian elongase consists of four 
subunits, made up of a condensing enzyme, a p-ketoreductase, a dehydrase and an enoyl 
reductase (reviewed in Cinti, D. L,, et al (1992) Prog, Lipid Res. 31, 1-51). The 
Arabidopsis FAEl gene product encodes a polypeptide of 56kDa, which shows very 
limited homology to condensing enzymes such as chalcone synthase and stillbene synthase 
(James, D. W. supra). Although FAEl is normally only expressed in seed tissues, ectopic 
expression in non-seed tissue (or heteroiogously in yeast) revealed that FAEJ could direct 
the synthesis oferucic acid (Millar, A. A., Kunst, L, (1997) Plant J, 12, 121-131). 

Three fatty acid elongase activities have been characterised from the yeast S. cerevisiae. 
Again, this organism does not synthesis PUFAs, and therefore does not contain genes 
encoding a PUFA elongase. One gene ELOl, was identified on the basis of a screen to 
isolate mutants defective in elongation of 14 carbon (i.e. medium) chain saturated fatty 
acids (Toke & Martin (1996) J Biol Chem 271, 18413-18422). Complementation of elol 
mutants restored viability, and the ELOl gene product was shown to encode a polypeptide 
which was responsible for the specific elongation of 14:0 fatty acids to 16:0 fatty acids. 

Two related genes were also detected in the genome of 5. cerevisiae^ and their function 
determined by disruption. These two genes, subsequently named EL02 and EL03, were 
shown to be involved in the elongation of the very long chain saturated fatty acids found in 
sphingolipid molecules (Oh et al (1997), J. Biol Chem 272, 17376-17384), In particular, 
EL02 was required for elongation of fatty acids up to 24 carbons, and EL03 was required 
for elongation of the 24 carbon fatty acid to 26 carbons. However, neither gene was 
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essential for viability. Examination of the these three fatty acid elongases revealed the 
presence of a conserved "histidine box" motif (Shanklin et aL, (1994), Biochemistry^ 33, 
1278742794) (His-X-X-His-His, where X is any amino acid) towards the centre of the 
polypeptide sequences. Importantly, there was no detectable homology between the yeast 
elongases (EL01,23) and the plant very long chain mono-unsaturated fatty acid elongase 
KEY) (Oh etal supra). 

in order to identify genes encoding PUFA elongases, it is necessary to study systems in 
which the synthesis of PUFAs is well documented; a good example of this is the model 
animal system C elegans, a small free-living worm (Tanaka et al, (1996), Lipids 31, 
1173-1178). C elegans, like most other animals, and in contrast to higher plants, 
synthesises PUFAs such as arachidonic acid (AA; 20:4 A^'^'^*'^"*) as precursors to a class of 
molecules known as the eicosanoids, which in turn serve as precursors for compounds such 
as prostaglandins and leucotrienes (Horrobin, (1990), Reviews in Contemp Pharmacotherpy, 
1:1-45). The presence of AA and other long chain polyunsaturated fatty acids in C elegans 
is well documented (Tanaka et aL (1996), Lipids 31. 1 173-1 178). The complete sequence 
of the nematode's genome is now pubUcly available (ITie C elegans consortium, J998, 
Science 282, 2012-2QI8: Database at 

http://www,sangerMCMk/Projects/Cjalgans/blast_serveKshtml), 

An object of the invention is to provide an isolated PUFA elongase. 

Using the above-mentioned C elegans genomic sequence, together with suitable search 
strings, the inventors identified eight related putative open reading frames (ORFs) encoding 
for PUFA elongases. A number of different search criteria were applied to identify a 
number of (ORFs) which were likely to encode polypeptides with fatty acid elongase 
activities. These ORFs were then subject to functional characterisation by heterologous 
expression in ye£^t, allowing the identification of a PUFA elongase. 

Accordingly, a first aspect of the invention provides an isolated polypeptide comprising a 
functional long chain polyunsaturated fatty acid (PUFA) elongase i.e. the polypeptide has 
the ftinction of extending the chain length of an 1 8 carbon PUFA to 20 carbons in length. 
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This polypeptide can be used to elevate PUFA levels in animals, thereby providing a ready 
soiirce of PUFAs. 

The polypeptide may be from a eukaryote. 

The polypeptide may comprise at least a portion of the amino acid shown in SEQ E). 15, or 
variants thereof 

For the purposes of the present application, the term ^Variant" in relation to a certain 
sequence means a protein or polypeptide which is derived from the sequence through the 
insertion or deletion of one or more amino acid residues or the substitution of one or more 
animo acid residues with amino acid residues having similar properties, e.g. the replacement 
of a polar amino acid residue with another polar amino acid residue, or the replacement of a 
non-polar amino acid residue with another non-polar amino acid residue. In all cases, 
variants must have an elongase fimction as defined herein. * 

A second aspect of the invention provides a polypeptide having at least 60 % homology to a 
polypeptide according to a first aspect of the invention. The polypeptide may have at least 
80%, or as much as 90% or more homology to a polypeptide according to a first aspect of 
the invention. 

The polypeptide according to either aspect of the invention may include a sequence motif 
responsible for Endoplasmic Reticulum (ER) - retention. This allows the polypeptide to be 
specifically located or targeted to the ER of a cell 

The polypeptide may also be able to elongate palmitoleic acid (PA; 16:1A^) to vacceric acid 
(VA; 18:1A^^). Thus, the polypeptide is also capable of elongation of a A^- monounsaturated 
16C fatty acid 

Preferably, the polypeptide is from an animal, more preferably, the animal is an invertebrate 
such as a worm. Where the animal is a worm, it is preferably C elegans. Alternatively, the 
animal is a vertebrate, preferably a mammal such as a human, rat or mouse. 
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A third aspect of the invention provides an isolated DNA sequence, preferably a cDNA 
sequence, encoding a polypeptide according to a first or second aspect of the invention. 
This DNA sequence maybe used to engineer transgenic organisms. 

Preferably, the DNA sequence comprises the sequence shown in SEQ ID NO: 7 or variants 
of that sequence due, for example, to base substitutions, deletions, and/or additions. 

A fourth aspect of the invention provides an engineered organism, such as a transgenic 
animal, engineered to express a polypeptide according to a first or second aspect of the 
invention. The engineered organism may be engineered to express elevated levels of the 
polypeptide, thereby providing a supply of polypeptide at a reduced cost as a reduced 
number of organisms need be used. 

Preferably, the engineered organism is a mammal such as a rat, mouse or monkey, 

A fifth aspect of the invention provides an engineered organism containing a synthetic 
pathway for the production of a polypeptide according to a first or second aspect of the 
invention. This has the advantage of allowing greater control over the production of PUFAs 
by the pathway by an organism. 

The pathway may include A^-fatty acid desaturase, and/or A^-fatty acid desaturase. 

The engineered organism according to a fourth or fifth aspect of the invention may be a 
lower eukaryote, such as yeast. Alternatively, the transgenic organism may be a fish. 

A sixth aspect of the invention provides a transgenic plant engineered to express a 
polypeptide according to a first aspect of the invention. 

A seventh aspect of the invention provides a transgenic plant containing a DNA sequence 
according to a third aspect of the invention. 
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An eighth aspect of the invention provides a method of producing a PUFA comprising 
carrying out an elongase reaction catalj^ed by a polypeptide according to a first or second 
aspect of the invention. 

The PUFA may be di-homo-ganmia-linoleic acid (20:3A^'*''*'^), arachidonic acid 
(20:4A'*^'"'''), eicosapentanoic acid (20:5A^^^^"'^''^^), docosatrienoic acid (22:3A^''''^'), 
docosatetraenoic acid (22:4A^'^^'^^'*'), docosapentaenoic acid (22:5A^'^**-^^-^^'^') or 
docosahexaenoic acid (22:6A^^^^'«'^^''* 

The PUFA may be a 24 carbon fatty acid with at least 4 double bonds. 

A ninth aspect of the invention provides a PUFA produced by a method according to an 
eighth aspect of the invention. 

The PUFA maybe used in foodstuffs, dietary supplements or pharmaceutical compositions. 

A tenth aspect of the invention provides a foodstuff comprising a PUFA according to a fifth 
aspect of the invention. The foodstuff can be fed to an animal. 

An eleventh aspect of the invention provides a dietary supplement comprising a PUFA 
according to a fifth aspect of the invaition. The dietary supplement can be supplied to an 
animal to augment its PUFA levels. 

An twelfth aspect of the invention provides a pharmaceutical composition comprising a 
polypeptide according to a first or second aspect of the invention or a PUFA according to a 
ninth aspect of the invention. 

Preferably, the pharmaceutical composition comprises a pharmaceutically-acceptable 
diluent, carrier, excipient or extender. This allows tiie composition to be supplied in a form 
which best suits the pharmaceutical apphcation in question. For example, a topical 
application would preferably be a cream or lotion, whereas if the composition was to be 
ingested a different form would be more suitable. 
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A thirteenth a^ect of the invention provides a method of treatment of an animal, such as a 
mammal, or a plant, comprising supplying to the animal or plant a DNA sequence according 
to a third aspect of the invention, a foodstuff according to a tenth aspect of the invention, a 
dietary supplement according to an eleventh aspect of the invention, a pharmaceutical 
composition according to a twelfth aspect of the invention or a PUFA according to a ninth 
aspect of the invention. 

Preferably, the mammal is a htmian. 

The invention v^ill now be further described, by way of example only, with reference to SEQ 
IDl to 16, and Figures 2 to 1 1, in which; 

SEQ EDI to 8 show the putative ORFs encoding PUFA elongases A to H respectively; and 

SEQ ID9 to 16 show the deduced amino acid sequences of the putative ORFs of SEQ ID 
NO: 1 to 8 respectively; and 

Figures 2 to 9 show hydrophobicity plots for each of PUFA elongases A to H respectively. 

Figure 10 shows an amino acid sequence line-up comparing the C. elegans ORF F56H1 1.4 
(Z68749) with related sequences. 

Figure 1 1 shows chromatograms of fatty acid methyl esters from transformed yeast. 
IntroductiOB to general strategy 

Initially the C. elegans databases were searched for any sequences which showed low levels 
of homology to yeast ELO genes {EL02 and EL03) using the TBLASTN programme. A 
similar search was carried out using short (20 to 50 amino acid) stretches of ELO genes 
which were conserved amongst the three ELO polypeptide sequences. C elegans sequences 
which were identified by this method were then used themselves as search probes, to 
identify any related C elegans genes which the initial search with the yeast sequences failed 
to identify. This was necessary because the level of homology between the yeast ELO genes 
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and anx worm genes is always low (see BLAST scores later). To allow for a more sensitive 
search of worm sequences, a novel approach was adopted to circumvent the major drawback 
with searches using the BLAST programmes, namely that the search string (i.e. the input 
search motif) must be longer than 15 characters for the algorithm to work. Thus, if it was 
desired to search for a short motif (like a histidine box), then the BLAST programme would 
not be capable of doing this. A complete list of all the predicted ORFs present in the C 
elegans genome exists as a database called Wormpep, which is freely available from the 
Sanger WWW site (http://www.sanger.ac.uk/Projects/C__elegans/webace_front_end.shtml), 
The latest version of Wormpep was down loaded to the hard disc of a Pentium PC, and 
re-formatted as a Microsoft Word6 document, resulting in a document of about 3,500 pages. 
This was then searched using the "Search & Replace" function of Word6, which also allows 
for the introduction of **wildcard" characters into the search motif So, for example, it is 
possible to search both for the short text string HPGG, which would identify any predicted 
worm ORF present in the Wormpep 3,500 page document containing this motif, or 
alternatively search with HPGX (where X is a wild card character). Clearly, such (manual) 
searches of a 3,500 page document are extremely time-consuming and demanding, also 
requiring visual inspection of each and every identified ORF. For example, searching with a 
motif such as HXXHH identifies in excess of 300 different ORFs. However, by using a 
number of different short search strings (as outlined below), and combining these with other 
methods for identifying putative elongase enzymes, a number of candidate ORFs have been 
identified. 

Database search using the FAEl polypeptide sequence 

As a negative contml, to demonstrate that the FAEl gene sequence was unlikely to provide a 
useful search sequence in the identification of Celegans sequences encoding for PUFA 
elongases, the GenBank databases (http://www,ncbi.nlm.nih.gov/Web/Search/index.html) 
were searched using the Arabidopis FAEl polypeptide sequence to identify related genes or 
expressed sequence transcripts (ESTs), GenBank is the NIH genetic sequence database, an 
annotated collection of all publicly available DNA sequences {Nucleic Acid Research (1 998) 
26, 1-7). There are approximately 2,162,000,000 b^es in 3,044,000 sequence records as of 
December 1998, The search was carried out using the BLAST2 (Basic Local Aligmnent 
Search Tool) algorithm (Altschul et aU (1990) JMol Biol 215,403,410) Although a number 
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of plant ORFs and ESTs were reported as being related, no animal sequences were identified 
by this search, confirming the observation that FAEl was unlikely to be a suitable candidate 
as a search template for PUFA elongases. 

Database search using yeast ELO sequences 

Using the three yeast fatty acid elongase sequences (ELO 1, 2, 3) as probes, a number of 
putative ORFs in the DNA of C elegans-dcnv^d cosmid sequences which form the C 
elegans genomic sequence database were identified. Moreover, an extensive and 
time-consuming search of a downloaded copy of the WormPep database 
(ffcp://ftp.sanger,ac.ukypub/databases/wormpep) using manual search strings in MSWord 6, 
identified a number of C elegans ORFs which contained presumptive histidine boxes. 
Wormpep contains predicted proteins from the Caenorhabditis elegans genome sequence 
project, which is carried out jointly by the Sanger Centre in Cambridge, UK and Genome 
Sequencing Center in St. Louis, USA. The current Wormpep database, Wormpep 16, 
contains 16,332 protein sequences (7,120,115 residues). Search strings used included 
PIXXHH], [HXXXHH], [QXXHH] and [YHH]. Comparison of the data fi-om the two 
different searches indicated a small (<10) number of putative ORFs as candidate elongases. 
The histidine box motifs are shown in bold in SEQ ID 9 to 16. 

Hydropbobicity plot analysis 

Since the fatty acid elongase reaction is predicted to be carried out on the cytosolic face of 
the endomembrane system (Toke & Martin (1996), supra; Oh et al (1997), supra), the 
putative C. elegans ORFs were examined for potential membrane spanning domains, via 
Kyte & Dooiittle hydropbobicity plots (X MolBiol (1982), 157, 105-132). This revealed a 
number of ORFs with possible membrane-spanning domains, and also indicated a degree of 
similarity in the secondary-structure of a number of identified ORFs. 

Screening for ER-retention signal sequences 

The inventors postulated that since fatty acid elongases are expected to be endoplasmic 
reticulum (ER) membrane proteins, they might be expected to have peptide signals which 
are responsible for "ER-retention". In the case of ER membrane proteins, this signal often 
takes the form of a C-tenminal motif [K-K-X2-rStop], or similar variants thereof (Jackson et 
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aL (1990), EMBO 9, 3153-3162). Further sequence analysis of the C. elegam putative 
elongases revealed that 4 ORFs (F41Hia7, F41H10.8, F56H11A Y53F4Bx) had 
C-terminal motifs that exactly matched this search pattern, and that a further 2 ORFs 
(F11E6.5, C40HL4) had related sequences. These sequence motifs are underlined in SEQ 
ID9tol3,15andl6. 

Chromosome mapping 

Since the inventoi^ had previously observed that Celegans genes involved in the synthesis 
of PUFA may exist in tandem (for example the A5 and A6 desaturases required for AA and 
GLA synthesis, respectively, are < 1 IdB apart on chromosome IV (Michaelson et al, (1998), 
FEBS Letts 439, 215-218), the positions of the putative C elegans elongase ORFs were 
determined using the Sanger Centre's WebAce C elegans server 
(http://www.sanger.ac,uk/Projects/C_eIegans/webace_j5ont_end,shtml).. This indicated that 
two pairs of putative elongases were in close proximity to each other on the C elegans 
chromosome IV. 

F41H10.7 and F41H10.8 were identified as being approximately 10 Kb apart on 
chromosome IV, and F56H1 1 3 and F56H1 1 A were identified as being approximately 2 Kb 
apart on chromosome TV. 

Putative C elegans fatty add elongases 

The positions of the putative ORFs in the C elegans genome are shown below i.e. 
chromosome number, and map position in centiMorgans, together with the GenBank 
database accession numbers. 

The designations used employ the same method as used on the Sanger Centre's C elegans 
database, i.e. ORF C40HL4 is predicted coding sequence 4 on cosmid C40HL 

Elongase Cosmid Sanger ID GenBank Acc Chromosome 

Code 



C40HL4 



Z19154 in 
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B D2024.3 U41011 IV, 7.68 

C F11E6.5 Z81058 IV, 18.8 

D F41H10.7* U61954 IV, 29.8 

E F41H10.8* U61954 IV, 29.8 

F F56H11.3* Z68749 rv,2.5 

G F56H11.4* Z68749 IV, 2.5 

H Y53F4B.C Z92860 E 



* or* indicates genes in tandem 

Comparison of C. elegans putative eloogase ORFs with yeast genes: 

Each of the three yeast ELO polypeptides were compared against all of the woim putative 
elongase translated ORF sequences, and then ranked in order of similarity (as measured by 
the BLAST score) (Altschul et al (1990), supra) 

The results are shown below, with the ORF sequences ranked from most similar to least 
similar, and the BLAST scores are shown in brackets; 

Yeast ELOl ( 1 4 to 1 6 carbon fatty add elongase) 

G (262) > E (241) > D (225) > C (219) > A (216) > F (21 5) > H (197) > B (1 72) 

Yeast EL02 (24 carbon sphingolipid elongase) 



E(231)>C(226)>G(189)>A(I81)>F(166)>D(150)>H(141)>B(140) 
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Yeast EL03 (24 to 26 sphingolipid elongase) 

D (171) > G (163) > F (154) > A (152) > E (150) > C (131) > B (132) > H (128) 

It is clear from the numeric values of the BLAST scores fliat the sequences are related, but 
the levels of homology are low. For comparison, the BLAST score for homology between 
two related worm proteins, the A5 and the A6 desaturase is in excess of 500. 

Analysis of potential sphingolipid ancestry 

Previously, the inventors had noted the similarities between the fatty acid A6 desaturase and 
sphingolipid desaturases in plants, and that the two distinct enzymes could have arisen from 
one ancestral gene. Moreover, it was considered likely that the sphingolipid desaturase 
predated the fatty acid desaturase, and may in fact have been the ancesteral progenitor. 
Therefore it is plausible that the next step in the arachidonic acid biosynthetic pathway has 
also evolved from the sphingolipid metaboKc pathway. It is therefore considered highly 
significant that some of the C. elegans ORF putative elongases have similarity to 
sphingoKpid enzymes. For this reason, these ORFs are considered to be very clear 
candidates for PUFA elongases. It has previously been considered that the C elegans AS 
and A6 fatty acid desaturases have evolved from 1 ancestral gene (Michaelson et al, (1 998), 
FEES Letts 439, 215-218). It is also significant that one pair of C elegans putative 
elongase ORFs (F & G) genetically maps close to the A5/A6 fatty acid desaturase genes, 
with both gene pairs being located at the top end of chromosome IV. 

Cosmid Sanger ID GenBank Acc Chromosome Encoded Peptide 
Code 

W08D2.4 Z70271 IV, 3.06 A6 fatty acid desaturase 

T13F2.1 Z81 122 IV, 3.06 A5 fatty acid desaturase 
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Cloning of Desaturase and Elongase Genes in Yeast Expression Vectors 
Putative elongases sequences F56H11.4 and F41H10.8 were cloned by PCR into the 
pYES2 vector (Invitrogen). A C elegans mixed stage cDNA library was used as a PCR 
template. F56H1 L4 was amplified using primers: 

56hl 1 4.for 5^-GCGGGTACCATGGCTCAGCATCCGCTC-3 ' and; 

56hl i4.rev 5^GCGGGATCCTTAGTTGTTCTTCTTCTT-3\ 



F41H10.8 was amplified using primers: 

41hl08.for 5'-GCGGGTACCATGCCACAGGGAGAAGTC-3' and; 
41hl08,rev 5^GCGGGATCCTTATTCAATTTTTCTTTT^3'. 



Amplified sequences were then restricted using Kpnl and BamHI (underlined in the 
forward and reverse primers, respectively), purified using the Qiagen PCR purification kit, 
and ligated into a KpnI/BamHI cut pYes2 vector. 

An OFF encoding the Mortierella alpina A^-fatty acid desaturase (Michaelson, L. V., et al 
(1998)/ Biol Ckem. 273, 19055-19059) was amplified using primers: 

MadS.for 5^-G CGAATTCA CCATGGGTACGGACCAAGGA-3^ and; 

Mad5.rev 5'-GCGGAGCTCCTACTCTTCCTTGGGACG-3', 

and restricted using EcoRl and Sad, gel purified as described and ligated into a EcoRl/Sacl 
cut pESC-TRP vector (Stratagene) to generate pESC/A'. 

An ORF encoding the borage A^-fatty acid desaturase (Sayanova, O,, et al (1997) Proa 
Nail Acad Sci USA 94, 4211-4216) was restricted firom pGEM3 using BamHI and Xhol 
and ligated into a BamWXhol cut pESC-TRP vector to generate pESC/A^ 
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A double construct was also generated by ligating the BamHVXhol borage insert into the 
pESC/A^ construct described previously, generating pESC/(A^A^). 

Functional Characterisation in Yeast 

Elongases and desatnrase constructs were introduced in Saccharomyces cerevisiae 
W303-1 A using a lithium acetate based method (Elble, R. (1992) Biotechniques 13, 18-20) 
and expression of the transgenes was induced by addition of galactose to 2% (w/v) as 
described in Napier et al (Napier, J. A., et al (1998) Biochem J330, 61 1-614; Michaelson 
L. v., supra\ Michaelson, L. V., (1998) FEBS Letts 439, 215-218), Yeast transformants 
containing pYES2-derived constructs were grown on synthetic minimal media (SD, the 
composition of which is defined in Sherman, F {\99\) Methods in Enzymology 194, 3-21); 
synthetic minimal medium minus uracil; pESC-derived constructs were grown on SD 
minimal medium minus tryptophan. Co-transformed yeast (containing both pYES2 and 
pESC derivatives) were grown on SD minimal medium minus uracil and tryptophan. Prior 
to induction, cultures were grown in the presence of 2% raffinose and supplemented with 
0.5 mM of the appropriate fatty acid substrate in the presence of 1% tergitol-(NP40) 
(Sigma). All cultures were then grown for a further 48-h unless indicated. 

Fatty Acid Analysis 

To identify the elongation reaction responsible for the synthesis of di-homo-y-linolenic acid 
(DHGLA; 20:3 A^-^^-^"*) from GLA, this latter fatty acid was supplied as the (exogenous) 
substrate. 

Lipids were extracted from transformed and control yeast by homogenisation in 
MeOH-CHCls using a modification of the method of Bligh and Dyer (Dickenson & Lester 
(1999) Biochim Biophys Acta 1426, 347-357). The resulting CHCU phase was evaporated 
to dryness under nitrogen gas and the samples were transmethylated with IM HCl in 
methanol at 80 for I hour. Fatty acid methyl esters (FAMES) were extracted in hexane 
and purified using a small column packed with Florisil. Analysis of FAMES was 
conducted using a Hewlett Packard 5880A Series Gas Chromatograph equipped with a 
25M X 0.32mm RSL-500BP bonded capiHiary column and a flame ionisation detector. 
Fatty acids were identified by comparison of retention times with FAME standards (Sigma) 
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separated on the same GC, Quantitation was carried out using peak height area integrals 
expressed as a total of all integrals (Bligh, E.G. & Dyer, WJ. (1959) Can. J, Biochem. 
Physiol 31,911-917), 

Total fatty acids extracted irom yeast cultures were analysed by gas chromatography (GC) 
of methyl ester derivatives. Lipids were extracted, transmethylated and the fatty acid 
methyl est&rs (FAMEs) analysed as described by Sayanova et al 

Figure 1 1 shows chromatograms of fatty acid methyl esters from yeast transformed with the 
control (empty) plasmid pYES2 (Fig. 1 lA) or with ORF F56H1L4 in pYES2 (Fig. 1 IB). 
Exogenous substrate in the form of GLA was supplied to the cultures. Two novel peaks 
are observed in (B); these peaks (annotated as 20:3 and 18:1*) were identified (against 
known standards) as DHGLA and vaccenic acid, respectively. Detection was by flame 
ionisation. 

One cDNA ORF tested in this manner displayed a high level of elongase activity on the 
GLA substrate, converting 44% to DHGLA. The identity of this elongation product was 
confirmed as DHGLA by comparison with a known standard (the standards used were 
known standards for either DHGLA, AA, EPA or VA from Sigma Chemicals, Ltd.), using 
GCMS analysis using a Kratos MS80RFA (Napier, L A., supra; Michaelson, L. V,, supra; 
Michaelson, L. V., supra). The deduced amino acid sequence of the functional elongase 
clone identified it as being encoded by the C. elegans gene F56H1 1 .4, and comparison with 
the yeast ELO genes showed low homology confined to a few short amino acid motifs (see 
Fig. 10). Some similarity with a mouse gene Cig30 (Tvrdik, P., (1997) / BioL Chem, 111, 
31738-31746), which has been implicated in the recruitment of brown adipose tissue in 
liver tissue, was also observed, as well as a potential human homoiogue encoded by a gene 
located on chromosome 4q25, BAG 207d4. The most closely related C. elegans ORFs, 
F41H10.8 0J61954) and F56H1L3 (Z68749) are also shown, as is part of a related human 
gene present on chromosome IV (present on BAG clone B207d4; AC004050). The 
GenBank accession numbers are given for all sequences. 

The range of fatty acids synthesised by C elegans can potentially require a number of 
different elongation reactions (Tanaka, T., (1996) Lipids 31, 1173-1178). The 
substrate-specificity of the F56H11.4 PUFA elongase was therefore determined using a 
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range of exogenously supplied fatty acids. This revealed that GLA is the major substrate, 
with a number of other fatty acids being elongated at a lower efficiency (see Table 1). 
Although most of these substrates are polyunsaturated fatty acids, it was xmexpectedly 
observed that palmitoleoic. acid (PA; 16:1 A^) was also elongated by F56H1L4 to yield 
vaccenic acid (VA; 18:1 A*^). The biosynthetic pathway for VA is unclear, but the data 
indicate that it may be synthesised by elongation of A^-monounsaturated 16C fatty acid. 

The C elegans PUFA elongase ORF F56H1 1 .4 maps to the top of chromosome IV (at 432 
cM) with a related sequence (F56H113; 51 % similarity) located l,824bp downstream. 
Another C elegans gene (F41H10.8) was also observed, which is present on chromosome 
IV, and which shows a slightly higher level (53%) of similarity to the PUFA elongase than 
F56H1L3 (see Fig. 10). However, when a PGR product encoding ORF F41Hia8 was 
expressed in yeast in a manner identical to that used for F56H11.4, the former failed to 
direct the elongation of any fatty acids, despite the provision of a range of substrates (see 
Table II). 

In order to reconstitute the PUFA biosynthetic pathway in a heterologous system, the 
PUFA elongase F56H11.4 was expressed in yeast in conjunction with either the A^- or 
A^-fatty acid desaturases previously isolated and characterised by the inventor (Napier, J. 
A., supra; Michaelson, L, V., supra). Expression of the A^-fatty acid desaturase and 
F56H1 1.4 was carried out in the presence of two different substrates (LA or ALA) while 
the A^-fatty acid desaturase and the elongase were expressed in the presence of GLA only. 
This demonstrated that was possible to combine a desaturase and an elongase in yeast to 
generate significant amounts of a final "product'* (see Table III). In the case of the 
elongase and the A^-fatty acid desaturase, the reactions proved highly efficient with the 
production of 4.5% of DHGLA from the LA substrate. This resulted from 25% 
desaturation of the LA substrate to GLA, which was then elongated to DHGLA at a similar 
level of efficiency (18%). This is lower than the % conversion observed for GLA when 
supplied exogenously (see Table I), indicating that the in vivo production of substrates for 
elongation may be rate-limiting. 
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If ALA was used as a substrate, 27% of this was initially A^-desaturated to yield 
octadecatetraenoic acid (OTA; 18:4 A**^^^^^^^) but only 8% of was subsequently elongated to 
yield eicosatetraenoic acid (20:4 A^^^i'*^^^^), Thus, the conversion efficiency of ALA to the 
final 20-carbon tetraenoic PUFA was only about 2.2%. 

Since DHGLA is an n-d fatty acid, whilst the OTA-derived eicostetraenoic acid is an n-3 
type, this demonstrates that the elongase is capable of accepting both forms of essential 
fatty acid, albeit with different efficiencies. Verification was also provided that the 20C 
PUFAs synthesised in the yeast expression system were generated by the AMesaturation of 
18C substrates which were subsequently elongated, as the A^-desaturase showed no activity 
on 20:2 or 20:3 substrates (see Table III), 

The combination of the A^-desaturase and the elongase also demonstrated that these two 
enzymes could work in tandem, although the efficiency of this overall conversion was 
lower (3.3% AA fi-om GLA) which was due to the previously observed low activity of the 
AMesaturase enzyme itself (Michaelson, L. V., supra; Michaelson, L. V., supra). Thus, 
although nearly 45% of the GLA substrate was elongated to DHGLA, only 7.5% of this 
was then desaturated to AA (see Table HI). 

Finally, the production of either AA or eicosapentanoic acid (EPA; 20:5A^-^'^^'''''*^) in yeast 
firom dienoic or trienoic 18 carbon substrates was achieved via expression of all three 
enzymes (the two desaturases and the F56H11.4 PUFA elongase) simultaneously. As 
shown in Table IV, small but significant amounts of AA were produced when the yeast was 
supplied with the 18C dienoic fatty acid LA. 

GC-Mass Spectroscopy (MS) Analysis 

Peak identification and confirmation were carried out by GC-MS using a Kratos MS80RFA 
using known standards (Sigma). The identity of this 20C PUFA was verified by GCMS, 
indicating that the conversion eflBciency from LA was 0.65%. When ALA was used as a 
substrate, 12.5% of the (A^-desaturated and elongated) eicosatetraenoic n~3 fatty acid was 
A^-desaturated, resulting in a total conversion of 03% of the ALA substrate to EPA (the 
identity of EPA was confirmed by GCMS), 
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Expression of C elegans elongase in plants 

In order to express C. elegans elongase in plants, the following protocol is an example of a 
process which can be used to create the transgenic plants, C elegans ORF sequence can be 
subcloned into a plant expression vector pJD330, which comprises a viral 35S promoter, 
and a Nos terminator. The resulting cassette or promoter/coding sequence/terminator can 
then be subcloned into the plant binary transformation vector pBin 19, and the resulting 
plasmid introduced into Agrobacterium tumefaciens. This Agrobacterium strain can then be 
used to transform Arabidopsis by the vacuum-infiltration of inflorescences, and the seeds 
harvested and plated onto selective media containing kanamycin. Since pBin 19 confers 
resistance to this antibotic, only transfomied plant material will grow. Resistant lines can 
therefore be identified and self-fatilized to produce homozygous material. Leaf material 
can then be analyzed for expression of C elegans elongase. 

Fatty acid methyl ester analysis can be carried out as previously described. 
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SEQIDI 

C40H1.4 

atggagcttgccgagttctggaatgatctcaacaccttcaccatctacggaccgaatcac 
acagatatgaccacaaaatacaaatattcatatcacttcccaggtgaacaggtggcggat 
ccgcagtattggacgattttattccagaaatattggtatcattcgatcacaatatcagtt 
ctttatttcattttaattaaggtgattcaaaagtttatggagaatcgaaaaccattcact 
ttgaaatacccattgattctttggaatggagctcttgcagcattcagtataattgccaca 
ttgcggttctctattgatcctctacgatcactatatgctgaaggattctacaaaactctg 
tgctattcgtgtaatccaactgatgtggctgcattttggagctttgcattcgctctttcc 
aagattgttgaacttggagacactatgttcattattttgagaaaacggccattgatcttt 
ttacactactatcatcatgcagcagtgttaatctacactgtccattctggtgccgagcat 
actgcagctggtcgtttctacatcctaatgaactacttcgcacattctctcatgtatact 
tactacacagtttctgccatgggatacagattaccgaaatgggtatcaatgactgtcaca 
actgttcaaacaactcaaatgttagctggagtcggaataacttggatggtgtacaaagtg 
aaaactgaatacaagcttccttgtcaacaatccgtagccaatttgtatctcgcattcgtc 
atctatgtcacatttgccattcttttcattcaattcttcgtcaaggcatacattatcaag 
tcgtcgaagaagtcgaaatcggtgaagaacgaataa 



SEQID2 

D2024 .3 

atggcaaaatacgactacaatccgaagtatgggttagaaaattacagcatattccttccc 
tttgagacatcttttgatgcatttcgatcgacaacatggatgcaaaatcactggtatcaa 
tcaattacagcatctgtcgtgtatgtagccgtcatttttacaggaaagaaggtggttctc 
atctacaaaaaatcacgagttattacttttgagtctagccttcagaatgcaattaagaat 
cgaaaccgaaaatcacttaatagttctcaaatgtttcagattatggaaaagtacaagccc 
ttccaactggacacaccactcttcgtctggaattcatttttagccattttctcaattctc 
gggttcctccgaatgacacctgaatttgtatggagttggtcagcagaaggaaactcattc 
aaatattcaatttgtcattcatcttatgctcaaggagtcactggtttctggactgaacaa 
ttcgcaatgagcaaacttttcgagctcatcgacacaatcttcatcgttcttcgtaaacgt 
ccactcatcttccttcactggtatcatcatgtaactgttatgatctacacatggcacgcg 
tacaaggatcacactgcatcaggacggtggttcatttggatgaattatggagttcatgct 
cttatgtattcctactatgctcttcgttctctgaaattccgtcttccaaaacaaatggca 
atggttgttactactctccaacttgctcaaatggttatgggagtaatcatcggagtcact 
gtctaccgtatcaagtcatcgggtgaatactgccaacagacatgggacaatttgggatta 
tgctttggagtttatttcacatatttccttcttttcgccaacttcttctaccatgcatat 
gttaagaaaaacaaccgtacagtaaattatgaaaataattcaaaaaatttccccgatctc 
gttttaatttacctgagaaaaaaggtttcaagaaaatcgaaaaatcggcaatgttcagaa 
aataattataaaattcaattttcatcaaattttgttaatgttgatggaaaaaaacataag 
aaaacatatgaacttattcttccaagaagaaaaatgaccacaattttaacttttctattt 
ggaaaaaatcgaattttttcgaaatatcagaaaaatcgaaaaaacatttcgattcctgtt 
gatttcgaaattctggagccaaaagaagatatcaatgctaacatcgctgagccatccatc 
acaacgaggtccgccgccgcacgaagaaaagttcaaaaagctgattag 
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F11E6,5 

^tggcagcagcacaaacaagtccagcagccacgctcgtcgatgttttgacaaaaccatgg 
agtctggatcagactgattcttacatgtctacatttgtaccattatcctataaaatcatg 
attggttatctcgtcaccatctacttcgggcaaaaattaatggctcacagaaaaccattc 
gatctccaaaatacacttgctctctggaacttcgggttttcactgttctcgggaatcgcc 
gcctataagcttattccagaactattcggagttttcatgaaggacgggtttgtcgcttcc 
tactgtcaaaacgagaactactacaccgatgcatcaactggattctggggctgggccttt 
gtgatgtcgaaagctccagaactaggggatactatgttcttggtccttcgtaaaaaacca 
gttatcttcatgcactggtatcatcatgccctcacatttgtctacgcagtagtcacatac 
tctgagcatcaggcatgggctcgttggtctttggctctcaaccttgccgtccacactgtt 
atgtatttctacttcgccgttcgcgccttgaacatccaaactccacgcccagtggcaaag 
ttcatcactactattcaaattgtccaatttgtcatctcatgctacatttttgggcatttg 
gtattcattaagtctgctgattctgttcctggttgcgctgttagctggaatgtgctatcg 
atcggaggactcatgtacatcagttatttgttcctttttgccaagttcttctacaaggcc 
tacattcaaaaacgctcaccaaccaaaaccagcaagcaggagtag 



SEQ ID4 

F41H10.7 

atgtcatcggacgatcgtggcactagaaccttcaagatgatggatcaaattcttggaaca 
aacttcacttatgaaggtgccaaagaagttgctcgaggccttgaaggtttctcagcaaag 
cttgccgtcggatatattgccactatttttggactgaaatattatatgaaagaccgaaaa 
gccttcgatctcagtactccattaaacatttggaatggtattctttcgacattcagctta 
ttgggattcttattcacttttcctactttgttatcagttatcagaaaggatggatttagt 
cacacctattcccatgtctctgagctttacactgacagtacctctggatattggatcttc 
ctttgggttatctcaaagattccggaacttttggatacagtattcattgttcttcgcaag 
agaccacttattttcatgcactggtaccatcacgcattgaccggttactatgctcttgtc 
tgctaccatgaggatgctgtccatatggtttgggttgtatggatgaattatattattcat 
gcattcatgtatggatactatcttctgaaatctctgaaagttccaattccaccatcagtt 
gctcaagcaatcaccacatctcaaatggttcaattcgcagttgccattttcgcacaagtt 
catgtttcctataaacactatgttgagggagttgaaggattagcctactcgttcagagga 
acagctatcggatttttcatgcttactacctacttctatctatggattcaattctacaaa 
gagcactatcttaagaatggaggcaaaaagtacaatttggcaaaggatcaggcaaaaact 

caaacaaagaaggctaactaa 



SEQ IDS 

F41H10.8 {ce477) 

atgccacagg gagaagtctc attctttgag gtgctgacaa ctgctccatt 
cagtcatgag ctctcaaaaa agcatattgc acagactcag tatgctgctt 
tctggatctc aatggcatat gttgtcgtta tttttgggct caaggctgtc 
atgacaaacc gaaaaccatt tgatctcacg ggaccactga atctctggaa 
tgcgggtctt gctattttct caactctcgg atcacttgcc actacatttg 
gacttctcca cgagttcttc agccgtggat ttttcgaatc ttacattcac 
atcggagact tttataatgg actttctgga atgttcacat ggcttttcgt 
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tctctcaaaa gttgctgaat tcggagatac actttttatt attcttcgta 
aaaagccatt gatgttcctt cattggtatc atcatgtgct tacaatgaat 
tatgctttta tgtcatttga agctaatttg ggatttaata cttggattac 
atggatgaat ttctcagttc actcaattat gtatggatat tatatgcttc 
gttcttttgg tgtcaaggtt ccagcatgga ttgccaagaa tattacaaca 
atgcaaattc ttcaattcgt tattactcat ttcattcttt tccacgttgg 
atatttggca gttactggac aatctgttga ctcaactcca ggatattatt 
ggttctgcct tctcatggaa atctcttatg tcgttctgtt cggaaacttc 
tactatcaat catacatcaa gggaggtggc aagaagttta atgcagagaa 
gaagactgaa aagaaaattg aataa 



SEQID6 

F56H11.3 

atgtatttgaattatttcgcgacggaaatcttccatcgtagtgcggtttgtgaaacagaa 
gcttgtcgctcgtcaaaaataatgattgctgacgtgttcaaatggaaattcgatgcaaac 
gaattgtggagtcttttaacgaatcaggatgaagttttcccgcatattagagcacggcga 
ttcattcaagaacattttggtctattcgtccagatggcaattgcatatgtcattttggtg 
ttctcaatcaaaaggttcatgagggatcgtgaaccatttcaactcaccacagctcttcgt 
ctctggaacttcttcctctccgtcttctcaatttatggttcctggacaatgtttccattt 
atggttcaacaaataagactttatggtctctacggatgtggatgcgaagcactttcaaac 
cttccgagtcaagcagaatattggcttttcctgacgatcttgtccaaagctgtggagttt 
gttgatacatttttcttggttctccggaaaaaaccactcatcttcctacactggtatcat 
catatggcaacatttgtcttcttctgcagtaattacccgactccatcgtcacaatcacgc 
gtcggagttatcgtcaacctgttcgtgcatgccttcatgtacccatactatttcacccga 
tcaatgaacatcaaagttcctgcgaaaatttcaatggctgttacagttcttcaattgact 
caattcatgtgctttatctatggatgtactctcatgtactactcgttggccactaatcag 
gcacgatacccctcaaatacacctgcgacactccaatgtttgtcctacactctacatttg 
ctttga 



SEQ ID7 

F56H11,4 (Ce 166) 

atg gctcagcatc cgctcgttca acggcttctc gatgtcaaat tcgacacgaa 
acgatttgtg gctattgcta ctcatgggcc aaagaatttc cctgacgcaga 
aggtcgcaa gttctttgct gatcactttg atgttactat tcaggcttcaa 
tcctgtacat 

ggtcgttgtg ttcggaacaa aatggttcat gcgtaatcgt caaccattcc 
aattgactat tccactcaac atctggaatttcatcctcgc cgcattttcc 

atcgcaggag ctgtcaaaat gaccccagag ttctttggaa ccattgccaa 
caaaggaatt gtcgatcctactgc aaagtgtttg atttcacgaa aggagagaat 
ggatactgggt gtggctctt catggcttcc aaacttttcg aacttgttga 

caccatcttc ttggttctccgtaaacgtcc actcatgttc cttcactggt 

atcaccatat tctcaccatg atctacgcctggtactctca tccattgacc 

ccaggattca acagatacgg aatttatctt aactttgtcg 

tccacgcctt catgtactct tactacttcc ttcgctcgat gaagattcgc gtgc 

caggattcatcgccca agctatcaca tctcttcaaa tcgttcaatt catcatctc 
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t tgcgccgttcttgctcatct tggttatctc atgcacttca ccaat 
gccaactgt gatttcgagc catcagtatt caagctcgca 
gttttcatgg acacaacata cttggctctt ttcgtcaact tcttcctcca 
atcatatgtt 

ctccgcggag gaaaagacaa gtacaaggca gtgccaaaga agaagaacaa ctaa 



SEQ1D8 

Y53F4B.C 

atgtcggccg aagtgtccga acgattcaaa gtttggacag gaaacaatga 
gaccatcatc tattccccat tcgagtacga ttccacgttg ctcatcgagt 
catgtcggtg tacttatcag ctgcttatat tattgcgaca aatttattac 
agagatatat ggagtcacgg aaacctaaaa cttttactag catggaacgg 
ttttttggca gtgttcagta ttatgggtac atggagattt ggaatcgaat 
tctacgatgc tgttttcaga agaggcttca tcgattcgat ctgccfcggct 
gtaaatccac gttcaccgtc cgcattctgg gcatgcatgt tcgctctatc 
gaaaatcgcc gagtttgggg acacgatgtt cttggtgctg aggaaacggc 
cggttatatt ccttcactgg tatcatcacg ctgttgttct gatcctttct 
tggcatgctg caatcgaact cacagctcca ggacgctggt ttatttttat 
gaactatttg gtgcattcaa taatgtatac atactacgca ataacatcaa 
tcggctatcg tcttcccaaa atcgtttcaa tgactgttac attccttcaa 
actcttcaaa tgctcattgg tgtcagcatt tcttgcattg tgctttattt 
gaagcttaat ggagagatgt gccaacaatc ctacgacaat ctggcgttga 
gcttcggaat ctacgcctca ttcctggtgc tattctccag tttcttcaac 
aatgcatatt tggtaaaaaa ggacaagaaa cccgatgtga agaaggatta 
a 



SEQ ID9 
A 

1 MELAEFWNDL NTFTIYGPNH TDMTTKYKYS YHFPGEQVAD PQTE^TILFQK 

51 YWYHSITISV LYFILIKVIQ KFMENRKPFT LKYPLILWNG ALAAFSIIAT 

101 LRFSIDPLRS LYAEGFYKTL CYSCNPTDVA AFWSFAFALS KIVELGDTMF 

151 IILRKRPLIF LHYYHHAAVL lYTVHSGAEH TAAGRFYILM NYFAHSLMYT 

201 YYTVSAMGYR LPKWVSMTVT TVQTTQMLAG VGITWMVYKV KTEYKLPCQQ 

251 SVANLYLAFV lYVTFAILFI QFFVKAYIIK SSKKSKS VKN E* 



SEQ IDIO 
B 

1 MAKYDYNPKY GLENYSIFLP FETSFDAFRS TTWMQNHWYQ SITASWYVA 

51 VIFTGKKWL lYKKSRVITF ESSLQNAIKN RNRKSLNSSQ MFQIMEKYKP 

101 FQLDTPLFVW NSFLAIFSIL GFLRMTPEFV WSWSAEGNSF KYSICHSSYA 

151 QGVTGFWTEQ FAMSKLFELI DTI FIVLRKR PLIFLHWYHH VTVMIYTWHA 

201 YKDHTASGRW FIWMNYGVHA LMYSYYALRS LKFRLPKQMA MWTTLQLAQ 



'W000f55330 



26 



PCT/GBOO/01035 



251 MVMGVIIGVT VYRIKSSGEY CQQTWDNLGL CFGVYFTYFL LFANFFYHAY 

301 VKKNNRTVNY ENNSKNFPDL VLIYLRKKVS RKSKNRQCSB NNYKIQFSSN 

351 FVNVDGKKHK KTYELILPRR KMTTILTFLF GKNRIFSKYQ KNRKNISIPV 

401 DFEILEPKED INANIAEPSI TTRSAAARRK V OKAD* 



SEQ IDll 
C 

1 MAAAQTSPAA TLVDVLTKPW SLDQTDSYMS TFVPLSYKIM IGYLVTIYFG 

51 QKIiMAHRKPF DLQNTLALWN FGFSLFSGIA AYKLIPELFG VFMKDGFVAS 

101 YCQNENYYTD ASTGFWGWAF VMSKAPELGD TMFLVLRKKP VIFMHWYHHA 

151 LTFVYAWTY SEHQAWARWS LALHLAVHTV MYFYFAVRAL NIQTPRPVAK 

201 FITTIQIVQF VISCYIFGHL VFIKSADSVP GCAVSWNVLS IGGLMYISYL 

251 FLFAKFFYKA YIQKRSPTKT SKQE* 



SEQID12 



1 MSSDDRGTRT FKMMDQILGT NFTYEGAKEV ARGLEGFSAK LAVGYIATIF 

51 GLKYYMKDRK AFDLSTPLNI WNGILSTFSL LGFLFTFPTL LSVIRKDGFS 

101 HTYSHVSELY TDSTSGYWIF LWVISKIPEL LDTVFIVLRK RPLIFMHWYH 

151 HALTGYYALV CYHEDAVHMV WWWMNYIIH AFMYGYYLLK SLKVPIPPSV 

201 AQAITTSQMV QFAVAIFAQV HVSYKHYVEG VEGLAYSFRG TAIGFFMLTT 

251 YFYLWIQFYK SHYLKNGGKK YNLAKDQAKT Q TKKAN* 



SEQ ID13 

E 

MPQGEVSFFE VLTTAPFSHE LSKKHIAQTQ YAAFWISMAY WVIFGLKAV 
MTNRKPFDLT GPLNLWNAGL AIFSTLGSLA TTFGLLHEFF SRGFFESYIH 
IGDFYNGLSG MFTWLFVLSK VAEFGDTLFI ILRKKPLMFL HWYHKVLTMH 
YAFMSFEANL GFNTWITWMN FSVHSIMYGY YMLRSFGVKV PAWIAKNITT 
MQILQFVITH FILFHVGYLA VTGQSVDSTP GYYWFCLLME ISYWLFGNF 
YYQSYIKGGG KKFNAEKKTE KKIE* 



SEQ ID14 
F 

1 MYLNYFATEI FHRSAVCETE ACRSSKIMIA DVFKWKFDAN ELWSLLTNQD 
51 EVFPHIRARR FIQEHFGLFV QMAIAYVILV FSIKRFMRDR SPFQLTTALR 
101 LWNFFLSVFS lYGSWTMFPF MVQQIRLYGL YGCGCEALSN LPSQAEYWLF 
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151 LTILSKAVEF VDTFFLVLRK KPLIFLHWYH HMATFVFFCS NYPTPSSQSR 
201 VGVIVNLFVH AFMYPYYFTR SMNIKVPAKI SMAVTVLQLT QFMCFIYGCT 
251 LMYYSLATNQ ARYPSNTPAT LQCLSYTLHL L* ^ 



SEQ ID15 

G 

MAQHPLVQRL LDVKFDTKRF VAIATHGPKN FPDAEGRKFF ADHFDVTIQA 
SILYMVWFG TKWFMRNRQP FQLTIPLNIW NFILAAFSIA GAVKMTPEFF 
GTIANKGIVA SYCKVFDFTK GENGYWVWLF MASKLFELVD TIFLVLRKRP 
LMFLHWYHHI LTMIYAWYSH PLTPGFNRYG lYLNFWHAF MYSYYFLRSM 
KIRVPGFIAQ AITSLQIVQF IISCAVLAHL GYLMHFTNAN CDFEPSVFKL 
AVFMDTTYLA LFVNFFLQSY VLRGGKDKYK AVP KKKNN* 



SEQ ID16 
H 

MSAEVSERFKVWTGNNETIIYSPFEYDSTLLIESCRCTYQLLILLRQI 
YYRDIWSHGNLKACDXLLLAWNGFLAVFSIMGTWRFGIEFYDAVFRXG 
FIXS I CLAWPRSP SAFWACMFALSKI AEFGDTMFLVLRKRPVI FLHWYHH 
AWLILSWHAAIELTAPGRWFIFMNYLVHSIMYTYYAITSIGYRXPKIVSMT 
VTFLQTLQMLIGVS I SCI VLYLKLNGEMCQQSYDNLALSFGI YASFLVLSSFF 
NNAYLVKKDKKPDVKKD* 
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1. An isolated polypeptide comprising a functional long chain polyunsaturated fatty acid 
(PUFA) elongase as herein defined. 

I. A polypeptide according to claim 1 wherein the polypeptide is from a eukaryote. 

3. A polypeptide according to claim 1 or claim 2 wherein the polypeptide has at least a 
portion of the amino acid sequence shown in SEQ ID 15, or variants thereof 

4. A polypeptide having at least 60% homology to a polypeptide according to claim 3 and 
having a PUFA elongase function. 

5. A polypeptide according to claim 4 having at least 80% homology. 

6. A polypeptide according to claim 5 having at least 90% homology. 

7. A polypeptide according to any preceding claim wherein the polypeptide sequence 
includes a sequence motif responsible for Endoplasmic Reticulum (ER) - retention, 

8. A polypeptide according to any preceding claim wherein the polypeptide is capable of 
elongating palmitoleic acid (PA; 16:1A^) to vacceric acid (VA; 18:1A^*). 

9. A polypeptide according to any preceding claim wherein the polypeptide is from an 
animal 

10. A polypeptide according to claim 9 wherein the animal is an invertebrate. 

I I . A polypeptide according to claim 1 0 wherein the invertebrate is a worm. 
12. A polypeptide according to claim 1 1 wherein the worm is C elegans. 
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13. A polypeptide according to claim 9 wherein the animal is a vertebrate. 

14. A polypeptide according to claim 13 wherein the vertebrate is a mammal 

15. A polypeptide according to claim 14 wherein the mammal is a himian, rat or mouse. 

16. A DNA sequence encoding a polypeptide according to any preceding claim. 

17. A DNA sequence according to claim 16 wherein the DNA comprises the sequence 
shown in SEQ ID 7 or variants of that sequence due to base substitutions, deletions 
and/or additions. 

18. An engineered organism engineered to express a polypeptide according to any one of 
claims 1 to 15, 

19. An engineered organism according to claim 18 wherein the animal is a mammal. 

20. An engineered organism according to claim 19 wherein the mammal is a rat, mouse or 
monkey. 

21. An engineered organism containing a synthetic pathway for the production of a 
polypeptide according to any one of claims 1 to 1 5, 

22. An engineered organism according to claim 21 wherein the pathway includes A^-fatty 
acid desaturase. 

23. An engineered organism according to claim 21 or 22 wherein the pathway includes 
A*-fatty acid desaturase. 

24. An engineered organism according to any one of claims 21 to 23 wherein the animal is 
a lower eukaryote. 
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25. An engineered organism according to claim 24 wherein the lower eukaryote is a yeast. 

26. An engineered organism according to claim 18 wherein the animal is a fish. 

27. A transgenic plant engineered to express a polypeptide according to any one of claims 1 
to 15. 

28. A transgenic plant containing a DNA sequence according to claim 16 or 1 7. 

29. A method of producing a PUFA comprising carrying out an elongase reaction catalysed 
by a polypeptide according to any one of claims 1 to 1 5. 

30. A method according to claim 29 wherein the PUFA is di-homo-gamma-linoleic acid 
(20:3A'^^*^*'), arachidonic acid (20:4A''''*^^^'), eicosapentanoic acid (20:5A''''''^*''''}, 
docosatrienoic acid {22:3A^'^^*^'), docosatetraenoic acid (22:4A^'^^^^^-^'^), docosapentaenoic 
acid (22:5A''^'-*^^'^'^^ or docosahexaenoic acid (22:6A''''^'^^''^'^^'). 

31. A method according to claim 29 wherein the PUFA is a 24 carbon fatty acid with at least 
4 double bonds. 

32. A PUFA produced by a method according to any one of claims 29 to 3 1 . 

33. A foodstuff comprising a PUFA according to claim 32. 

34. A dietary supplement comprising a PUFA according to claim 32. 

35. A pharmaceutical composition comprising a polypeptide according to any one of claims 
1 to 15. 



36. A pharmaceutical composition comprising a PUFA according to claim 32. 
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37. A pharmaceutical composition according to claim 35 or claim 36 wherein the 
composition comprises a pharmaceutically-acceptable diluent, carrier, excipient or 
extender. 

38. A method of elevating the PUFA levels of an animal or a plant by supplying to the 
animal or plant a polypeptide according to any of claims 1 to 15, a DNA sequence 
according to claim 16 or claim 17, a foodstuff according to claim 33, a dietary 
supplement according to claim 34, a pharmaceutical composition according to any of 
claims 35 to 37 or a PUFA according to claim 32. 

39. A method of treatment according to claim 38 wherein the animal is a mammal 

40. A method of treatment according to claim 39 wherein the mammal is a human. 
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SEQUENCE LISTING 



<110> Napier, Johnathan A. 

<120> Polyunsaturated Fatty Acid (PUFA) 

<130> 76/7 

<140> PCT/GBOO/01035 
<141> 2000-03-20 

<160> 22 

<170> Patentln Ver. 2.1 

<210> 1 

<211> 27 

<212> DNA 

<213> C. elegans 

<400> 1 

gcgggtacca tggctcagca tccgctc 



<210> 2 

<211> 27 

<212> DNA 

<213> C. elegans 

<400> 2 

gcgggatcct tagttgttct tcttctt 



<210> 3 

<211> 27 

<212> DNA 

<213> C. elegans 

<400> 3 

gcgggtacca tgccacaggg agaagtc 



Elongase from Caenorhabditis elegans 



27 



27 



<210> 4 

<211> 27 

<212> DNA 

<213> C. elegans 

<400> 4 

gcgggatcct tattcaattt ttctttt 



<210> 5 

<211> 29 

<212> DNA 

<213> C. elegans 



1 



<400> 5 

gcgaattcac catgggtacg gaccaagga 



29 



<210> 6 

<211> 27 

<212> DNA 

<213> C. elegans 

<400> 6 

gcggagctcc tactcttcct tgggacg 



<210> 7 
<211> 87 6 
<212> DNA 
<213> C. elegans 

<400> 7 

atggagcttg ccgagttctg gaatgatctc 
acagatatga ccacaaaata caaatattca 
ccgcagtatt ggacgatttt attccagaaa 
ctttatttca ttttaattaa ggtgattcaa 
ttgaaatacc cattgattct ttggaatgga 
ttgcggttct ctattgatcc tctacgatca 
tgctattcgt gtaatccaac tgatgtggct 
aagattgttg aacttggaga cactatgttc 
ttacactact atcatcatgc agcagtgtta 
aqtgcagctg gtcgtttcta catcctaatg 
tactacacag tttctgccat gggatacaga 
actgttcaaa caactcaaat gttagctgga 
aaaactgaat acaagcttcc ttgtcaacaa 
atctatgtca catttgccat tcttttcatt 
tcgtcgaaga agtcgaaatc ggtgaagaac 



aacaccttca ccatctacgg accgaatcac 60 
tatcacttcc caggtgaaca ggtggcggat 120 
tattggtatc attcgatcac aatatcagtt 180 
aagtttatgg agaatcgaaa accattcact 240 
gctcttgcag cattcagtat aattgccaca 300 
ctatatgctg aaggattcta caaaactctg 360 
gcattttgga gctttgcatt cgctctttcc 420 
attattttga gaaaacggcc attgatcttt 480 
atctacactg tccattctgg tgccgagcat 540 
aactacttcg cacattctct catgtatact 600 
ttaccgaaat gggtatcaat gactgtcaca 660 
gtcggaataa cttggatggt gtacaaagtg 720 
tccgtagcca atttgtatct cgcattcgtc 780 
caattcttcg tcaaggcata cattatcaag 840 
gaataa 



<210> 8 
<211> 1308 
<212> DNA 
<213> C. elegans 

<400> 8 

atggcaaaat acgactacaa tccgaagtat 
tttgagacat cttttgatgc atttcgatcg 
tcaattacag catctgtcgt gtatgtagcc 
atctacaaaa aatcacgagt tattactttt 
cgaaaccgaa aatcacttaa tagttctcaa 
ttccaactgg acacaccact cttcgtctgg 
gggttcctcc gaatgacacc tgaatttgta 
aaatattcaa tttgtcattc atcttatgct 
ttcgcaatga gcaaactttt cgagctcatc 
ccactcatct tccttcactg gtatcatcat 
tacaaggatc acactgcatc aggacggtgg 
cttatgtatt cctactatgc tcttcgttct 



gggttagaaa attacagcat attccttccc 60 
acaacatgga tgcaaaatca ctggtatcaa 120 
gtcattttta caggaaagaa ggtggttctc 180 
gagtctagcc ttc'agaatgc aattaagaat 240 
atgtttcaga ttatggaaaa gtacaagccc 300 
aattcatttt tagccatttt ctcaattctc 360 
tggagttggt cagcagaagg aaactcattc 420 
caaggagtca ctggtttctg gactgaacaa 480 
gacacaatct tcatcgttct tcgtaaacgt 540 
gtaactgtta tgatctacac atggcacgcg 600 
ttcatttgga tgaattatgg agttcatgct 660 
ctgaaattcc gtcttccaaa acaaatggca 720 



atggttgtta ctactctcca acttgctcaa 
gtctaccgta tcaagtcatc gggtgaatac 
tgctttggag tttatttcac atatttcctt 
gttaagaaaa acaaccgtac agtaaattat 
gttttaattt acctgagaaa aaaggtttca 
aataattata aaattcaatt ttcatcaaat 
aaaacatatg aacttattct tccaagaaga 
ggaaaaaatc gaattttttc gaaatatcag 
gatttcgaaa ttctggagcc aaaagaagat 
acaacgaggt ccgccgccgc acgaagaaaa 



atggttatgg gagtaatcat cggagtcact 7 80 
tgccaacaga catgggacaa tttgggatta 840 
cttttcgcca acttcttcta ccatgcatat 900 
gaaaataatt caaaaaattt ccccgatctc 960 
agaaaatcga aaaatcggca atgttcagaa 1020 
tttgttaatg ttgatggaaa aaaacataag 1080 
aaaatgacca caattttaac ttttctattt 1140 
aaaaatcgaa aaaacatttc gattcctgtt 1200 
atcaatgcta acatcgctga gccatccatc 1260 
gttcaaaaag ctgattag 1308 



<210> 9 
<211> 825 
<212> DNA 
<213> C. elegans 

<400> 9 

atggcagcag cacaaacaag tccagcagcc 
agtctggatc agactgattc ttacatgtct 
attggttatc tcgtcaccat ctacttcggg 
gatctccaaa atacacttgc tctctggaac 
gcctataagc ttattccaga actattcgga 
tactgtcaaa acgagaacta ctacaccgat 
gtgatgtcga aagctccaga actaggggat 
gttatcttca tgcactggta tcatcatgcc 
tctgagcatc aggcatgggc tcgttggtct 
atgtatttct acttcgccgt tcgcgccttg 
ttcatcacta ctattcaaat tgtccaattt 
gtattcatta agtctgctga ttctgttcct 
atcggaggac tcatgtacat cagttatttg 
tacattcaaa aacgctcacc aaccaaaacc 



acgctcgtcg atgttttgac aaaaccatgg 60 
acatttgtac cattatccta taaaatcatg 120 
caaaaattaa tggctcacag aaaaccattc 180 
ttcgggtttt cactgttctc gggaatcgcc 240 
gttttcatga aggacgggtt tgtcgcttcc 300 
gcatcaactg gattctgggg ctgggccttt 360 
actatgttct tggtccttcg taaaaaacca 420 
ctcacatttg tctacgcagt agtcacatac 480 
ttggctctca accttgccgt ccacactgtt 540 
aacatccaaa ctccacgccc agtggcaaag 600 
gtcatctcat gctacatttt tgggcatttg 660 
ggttgcgctg ttagctggaa tgtgctatcg 720 
ttcctttttg ccaagttctt ctacaaggcc 780 
agcaagcagg agtag 825 



<210> 10 
<211> 861 
<212> DNA 
<213> C. elegans 

<400> 10 

atgtcatcgg acgatcgtgg cactagaacc 
aacttcactt atgaaggtgc caaagaagtt 
cttgccgtcg gatatattgc cactattttt 
gccttcgatc tcagtactcc attaaacatt 
ttgggattct tattcacttt tcctactttg 
cacacctatt cccatgtctc tgagctttac 
ctttgggtta tctcaaagat tccggaactt 
agaccactta ttttcatgca ctggtaccat 
tgctaccatg aggatgctgt ccatatggtt 
gcattcatgt atggatacta tcttctgaaa 
gctcaagcaa tcaccacatc tcaaatggtt 
catgtttcct ataaacacta tgttgaggga 
acagctatcg gatttttcat gcttactacc 
gagcactatc ttaagaatgg aggcaaaaag 
caaacaaaga aggctaacta a 



ttcaagatga tggatcaaat tcttggaaca 60 
gctcgaggcc ttgaaggttt ctcagcaaag 120 
ggactgaaat attatatgaa agaccgaaaa 180 
tggaatggta ttctttcgac attcagctta 240 
ttatcagtta tcagaaagga tggatttagt 300 
actgacagta cctctggata ttggatcttc 360 
ttggatacag tattcattgt tcttcgcaag 420 
cacgcattga ccggttacta tgctcttgtc 480 
tgggttgtat ggatgaatta tattattcat 540 
tctctgaaag ttccaattcc accatcagtt 600 
caattcgcag ttgccatttt cgcacaagtt 660 
gttgaaggat tagcctactc gttcagagga 720 
tacttctatc tatggattca attctacaaa 780 
tacaatttgg caaaggatca ggcaaaaact 840 

861 



<210> 11 
<211> 825 
<212> DNA 
<213> C. elegans 

<400> 11 _^ 
atgccacagg gagaagtctc attctttgag gtgctgacaa ctgctccatt cagtcatgag 60 
ctctcaaaaa agcatattgc acagactcag tatgctgctt tctggatctc aatggcatat 120 
gttgtcgtta tttttgggct caaggctgtc atgacaaacc gaaaaccatt tgatctcacg 180 
ggaccactga atctctggaa tgcgggtctt gctattttct caactctcgg atcacttgcc 240 
actacatttg gacttctcca cgagttcttc agccgtggat ttttcgaatc ttacattcac 300 
atcggagact tttataatgg actttctgga atgttcacat ggcttttcgt tctctcaaaa 360 
gttgctgaat tcggagatac actttttatt attcttcgta aaaagccatt gatgttcctt 420 
cattggtatc atcatgtgct tacaatgaat tatgctttta tgtcatttga agctaatttg 480 
ggatttaata cttggattac atggatgaat ttctcagttc actcaattat gtatggatat 540 
tatatgcttc gttcttttgg tgtcaaggtt ccagcatgga ttgccaagaa tattacaaca 600 
atgcaaattc ttcaattcgt tattactcat ttcattcttt tccacgttgg atatttggca 660 
gttactggac aatctgttga ctcaactcca ggatattatt ggttctgcct tctcatggaa 720 
atctcttatg tcgttctgtt cggaaacttc tactatcaat catacatcaa gggaggtggc 780 
aagaagttta atgcagagaa gaagactgaa aagaaaattg aataa 825 



<210> 12 
<211> 846 
<212> DNA 
J <213> C. elegans 



<400> 12 

atgtatttga attatttcgc gacggaaatc ttccatcgta gtgcggtttg tgaaacagaa 60 
gcttgtcgct cgtcaaaaat aatgattgct gacgtgttca aatggaaatt cgatgcaaac 120 
gaattgtgga gtcttttaac gaatcaggat gaagttttcc cgcatattag agcacggcga 180 
ttcattcaag aacattttgg tctattcgtc cagatggcaa ttgcatatgt cattttggtg 240 
ttctcaatca aaaggttcat gagggatcgt gaaccatttc aactcaccac agctcttcgt 300 
S ctctggaact tcttcctctc cgtcttctca atttatggtt cctggacaat gtttccattt 360 

H atggttcaac aaataagact ttatggtctc tacggatgtg gatgcgaagc actttcaaac 420 

^ ■ cttccgagtc aagcagaata ttggcttttc ctgacgatct tgtccaaagc tgtggagttt 480 
gttgatacat ttttcttggt tctccggaaa aaaccactca tcttcctaca ctggtatcat 540 
catatggcaa catttgtctt cttctgcagt aattacccga ctccatcgtc acaatcacgc 600 
gtcggagtta tcgtcaacct gttcgtgcat gccttcatgt acccatacta tttcacccga 660 
tcaatgaaca tcaaagttcc tgcgaaaatt tcaatggctg ttacagttct tcaattgact 720 
caattcatgt gctttatcta tggatgtact ctcatgtact actcgttggc cactaatcag 7 80 
gcacgatacc cctcaaatac acctgcgaca ctccaatgtt tgtcctacac tctacatttg 840 

846 

ctttga 



<210> 13 
<211> 866 
<212> DNA 
<213> C. elegans 

<400> 13 

atggctcagc atccgctcgt tcaacggctt ctcgatgtca aattcgacac gaaacgattt 
gtggctattg ctactcatgg gccaaagaat ttccctgacg cagaaggtcg caagttcttt 



4 



gctgatcact ttgatgttac tattcaggct 
acaaaatggt tcatgcgtaa tcgtcaacca 
aatttcatcc tcgccgcatt ttccatcgca 
ggaaccattg ccaacaaagg aattgtcgat 
gagagaatgg atactgggtg tggctcttca 
ccatcttctt ggttctccgt aaacgtccac 
tcaccatgat ctacgcctgg tactctcatc 
tttatcttaa ctttgtcgtc cacgccttca 
agattcgcgt gccaggattc atcgcccaag 
tcatctcttg cgccgttctt gctcatcttg 
gtgatttcga gccatcagta ttcaagctcg 
ttttcgtcaa cttcttcctc caatcatatg 
cagtgccaaa gaagaagaac aactaa 



tcaatcctgt acatggtcgt tgtgttcgga 180 
ttccaattga ctattccact caacatctgg 240 
ggagctgtca aaatgacccc agagttcttt 300 
cctactgcaa agtgtttgat ttcacgaaag 360 
tggcttccaa acttttcgaa cttgttgaca 420 
tcatgttcct tcactggtat caccatattc 480 
cattgacccc aggattcaac agatacggaa 540 
tgtactctta ctacttcctt cgctcgatga 600 
ctatcacatc tcttcaaatc gttcaattca 660 
gttatctcat gcacttcacc aatgccaact 720 
cagttttcat ggacacaaca tacttggctc 780 
ttctccgcgg aggaaaagac aagtacaagg 840 

866 



<210> 14 
<211> 801 
<212> DNA 
<213> C* elegans 

<400> 14 

atgtcggccg aagtgtccga acgattcaaa gtttggacag gaaacaatga gaccatcatc 60 
tattccccat tcgagtacga ttccacgttg ctcatcgagt catgtcggtg tacttatcag 120 
ctgcttatat tattgcgaca aatttattac agagatatat ggagtcacgg aaacctaaaa 180 
cttttactag catggaacgg ttttttggca gtgttcagta ttatgggtac atggagattt 240 
ffl ggaatcgaat tctacgatgc tgttttcaga agaggcttca tcgattcgat ctgcctggct 300 

gtaaatccac gttcaccgtc cgcattctgg gcatgcatgt tcgctctatc gaaaatcgcc 360 
gagtttgggg acacgatgtt cttggtgctg aggaaacggc cggttatatt ccttcactgg 420 
tatcatcacg ctgttgttct gatcctttct tggcatgctg caatcgaact cacagctcca 480 
ggacgctggt ttatttttat gaactatttg gtgcattcaa taatgtatac atactacgca 540 
ataacatcaa tcggctatcg tcttcccaaa atcgtttcaa tgactgttac attccttcaa 600 
/ actcttcaaa tgctcattg^ tgtcagcatt tcttgcattg tgctttattt gaagcttaat 6 60 
. ggagagatgt gccaacaatc ctacgacaat ctggcgttga gcttcggaat ctacgcctca 720 
ttcctggtgc tattctccag tttcttcaac aatgcatatt tggtaaaaaa ggacaagaaa 780 
^ cccgatgtga agaaggatta a 801 

<210> 15 
<211> 291 
<212> PRT 
<213> C. elegans 

<400> 15 

Met Glu Leu Ala Glu Phe Trp Asn Asp Leu Asn Thr Phe Thr lie Tyr 
1 5 10 15 

Gly Pro Asn His Thr Asp Met Thr Thr Lys Tyr Lys Tyr Ser Tyr His 
20 25 30 

Phe Pro Gly Glu Gin Val Ala Asp Pro Gin Tyr Trp Thr lie Leu Phe 
35 40 45 

Gin Lys Tyr Trp Tyr His Ser lie Thr lie Ser Val Leu Tyr Phe lie 
50 55 60 



5 



Leu He Lys Val He Gin Lys Phe Met Glu Asn Arg Lys Pro Phe Thr 
65 70 75 80 

Leu Lys Tyr Pro Leu He Leu Trp Asn Gly Ala Leu Ala Ala Phe Ser 
85 90 95 

lie He Ala Thr Leu Arg Phe Ser He Asp Pro Leu Arg Ser Leu Tyr 
100 105 110 

Ala Glu Gly Phe Tyr Lys Thr Leu Cys Tyr Ser Cys Asn Pro Thr Asp 
115 120 125 

Val Ala Ala Phe Trp Ser Phe Ala Phe Ala Leu Ser Lys He Val Glu 
130 135 140 

Leu Gly Asp Thr Met Phe He He Leu Arg Lys Arg Pro Leu He Phe 
145 ^ 150 155 160 

Leu His Tyr Tyr His His Ala Ala Val Leu He Tyr Thr Val His Ser 
165 170 175 

Gly Ala Glu His Thr Ala Ala Gly Arg Phe Tyr He Leu Met Asn Tyr 
180 185 190 

Phe Ala His Ser Leu Met Tyr Thr Tyr Tyr Thr Val Ser Ala Met Gly 
195 200 205 

Tyr Arg Leu Pro Lys Trp Val Ser Met Thr Val Thr Thr Val Gin Thr 
210 215 220 

Thr Gin Met Leu Ala Gly Val Gly He Thr Trp Met Val Tyr Lys Val 
225 230 235 240 

Lys Thr Glu Tyr Lys Leu Pro Cys Gin Gin Ser Val Ala Asn Leu Tyr 
245 250 255 

Leu Ala Phe Val He Tyr Val Thr Phe Ala He Leu Phe He Gin Phe 
260 265 270 

Phe Val Lys Ala Tyr He He Lys Ser Ser Lys Lys Ser Lys Ser Val 
275 280 285 

Lys Asn Glu 
290 



<210> 16 
<211> 435 
<212> PRT 
<213> C, elegans 



<400> 16 

Met Ala Lys Tyr Asp Tyr Asn Pro Lys Tyr Gly Leu Glu Asn Tyr Ser 
15 10 15 



V 



He Phe Leu Pro Phe Glu Thr Ser Phe Asp Ala Phe Arg Ser Thr Thr 
20 25 30 

Trp Met Gin Asn His Trp Tyr Gin Ser He Thr Ala Ser Val Val Tyr 
35 40 45 

Val Ala Val He Phe Thr Gly Lys Lys Val Val Leu He Tyr Lys Lys 
50 55 60 

Ser Arg Val He Thr Phe Glu Ser Ser Leu Gin Asn Ala He Lys Asn 
65 70 75 80 

Arg Asn Arg Lys Ser Leu Asn Ser Ser Gin Met Phe Gin He Met Glu 
85 90 95 

Lys Tyr Lys Pro Phe Gin Leu Asp Thr Pro Leu Phe Val Trp Asn Ser 
100 105 110 

Phe Leu Ala He Phe Ser He Leu Gly Phe Leu Arg Met Thr Pro Glu 
115 120 125 

=1 Phe Val Trp Ser Trp Ser Ala Glu Gly Asn Ser Phe Lys Tyr Ser He 

^ 130 135 140 

s „ r 

S Cys His Ser Ser Tyr Ala Gin Gly Val Thr Gly Phe Trp Thr Glu Gin 

% 145 150 155 160 

■"^ Phe Ala Met Ser Lys Leu Phe Glu Leu He Asp Thr He Phe He Val 

'/I 165 170 175 



Leu Arg Lys Arg Pro Leu He Phe Leu His Trp Tyr His His Val Thr 
180 185 190 

Val Met He Tyr Thr Trp His Ala Tyr Lys Asp His Thr Ala Ser Gly 
195 200 205 

Arg Trp Phe He Trp Met Asn Tyr Gly Val His Ala Leu Met Tyr Ser 
210 215 220 

Tyr Tyr Ala Leu Arg Ser Leu Lys Phe Arg Leu Pro Lys Gin Met Ala 
225 230 235 240 

Met Val val Thr Thr Leu Gin Leu Ala Gin Met Val Met Gly Val He 
245 250 255 

He Gly Val Thr Val Tyr Arg He Lys Ser Ser Gly Glu Tyr Cys Gin 
260 265 270 

Gin Thr Trp Asp Asn Leu Gly Leu Cys Phe Gly Val Tyr Phe Thr Tyr 
275 280 285 

Phe Leu Leu Phe Ala Asn Phe Phe Tyr His Ala Tyr Val Lys Lys Asn 
290 295 300 



7 



Asn Arg Thr Val Asn Tyr Glu Asn Asn Ser Lys Asn Phe Pro Asp Leu 
305 310 315 320 

Val Leu He Tyr Leu Arg Lys Lys Val Ser Arg Lys Ser Lys Asn Arg 
325 330 335 

Gin Cys Ser Glu Asn Asn Tyr Lys He Gin Phe Ser Ser Asn Phe Val 
340 345 350 

Asn Val Asp Gly Lys Lys His Lys Lys Thr Tyr Glu Leu He Leu Pro 
355 360 365 

Arg Arg Lys Met Thr Thr He Leu Thr Phe Leu Phe Gly Lys Asn Arg 
370 375 380 

He Phe Ser Lys Tyr Gin Lys Asn Arg Lys Asn He Ser He Pro Val 
385 390 395 400 

Asp Phe Glu He Leu Glu Pro Lys Glu Asp He Asn Ala Asn He Ala 
405 410 415 

Glu Pro Ser He Thr Thr Arg Ser Ala Ala Ala Arg Arg Lys Val Gin 
420 425 430 

Lys Ala Asp 
435 



<210> 17 
<211> 274 
<212> PRT 
<213> C. elegans 

<400> 17 

Met Ala Ala Ala Gin Thr Ser Pro Ala Ala Thr Leu Val Asp Val Leu 
15 10 15 

Thr Lys Pro Trp Ser Leu Asp Gin Thr Asp Ser Tyr Met Ser Thr Phe 
20 25 30 

Val Pro Leu Ser Tyr Lys He Met He Gly Tyr Leu Val Thr He Tyr 
35 40 45 

Phe Gly Gin Lys Leu Met Ala His Arg Lys Pro Phe Asp Leu Gin Asn 
50 55 60 

Thr Leu Ala Leu Trp Asn Phe Gly Phe Ser Leu Phe Ser Gly He Ala 
65 70 75 80 

Ala Tyr Lys Leu He Pro Glu Leu Phe Gly Val Phe Met Lys Asp Gly 
85 90 95 

Phe Val Ala Ser Tyr Cys Gin Asn Glu Asn Tyr Tyr Thr Asp Ala Ser 
100 105 110 



8 



Thr Gly Phe Trp Gly Trp Ala Phe Val Met Ser Lys Ala Pro Glu Leu 
115 120 125 



Gly Asp Thr Met Phe Leu Val Leu Arg Lys Lys Pro Val He Phe Met 
130 135 140 

His Trp Tyr His His Ala Leu Thr Phe Val Tyr Ala Val Val Thr Tyr 
145 150 155 160 

Ser Glu His Gin Ala Trp Ala Arg Trp Ser Leu Ala Leu Asn Leu Ala 
165 170 175 

Val His Thr Val Met Tyr Phe Tyr Phe Ala Val Arg Ala Leu Asn He 
180 185 190 

Gin Thr Pro Arg Pro Val Ala Lys Phe He Thr Thr He Gin He Val 
195 200 205 

Gin Phe Val He Ser Cys Tyr He Phe Gly His Leu Val Phe He Lys 
210 215 220 

Ser Ala Asp Ser Val Pro Gly Cys Ala Val Ser Trp Asn Val Leu Ser 
225 230 235 240 

He Gly Gly Leu Met Tyr He Ser Tyr Leu Phe Leu Phe Ala Lys Phe 
245 250 255 

Phe Tyr Lys Ala Tyr He Gin Lys Arg Ser Pro Thr Lys Thr Ser Lys 
260 265 270 



Gin Glu 



<2i0> 18 
<211> 286 
<212> PRT 
<213> C. elegans 

<400> 18 

Met Ser Ser Asp Asp Arg Gly Thr Arg Thr Phe Lys Met Met Asp Gin 
15 10 15 

He Leu Gly Thr Asn Phe Thr Tyr Glu Gly Ala Lys Glu Val Ala Arg 
20 25 30 

Gly Leu Glu Gly Phe Ser Ala Lys Leu Ala Val Gly Tyr He Ala Thr 
35 40 45 

He Phe Gly Leu Lys Tyr Tyr Met Lys Asp Arg Lys Ala Phe Asp Leu 
50 55 60 

Ser Thr Pro Leu Asn lie Trp Asn Gly He Leu Ser Thr Phe Ser Leu 



65 



70 75 80 



Leu Gly Phe Leu Phe Thr Phe Pro Thr Leu Leu Ser Val He Arg Lys 
85 90 95 

Asp Gly Phe Ser His Thr Tyr Ser His Val Ser Glu Leu Tyr Thr Asp 
100 105 110 

Ser Thr Ser Gly Tyr Trp He Phe Leu Trp Val He Ser Lys He Pro 
115 120 125 

Glu Leu Leu Asp Thr Val Phe He Val Leu Arg Lys Arg Pro Leu He 
130 135 140 

Phe Met His Trp Tyr His His Ala Leu Thr Gly Tyr Tyr Ala Leu Val 
145 150 155 160 

Cys Tyr His Glu Asp Ala Val His Met Val Trp Val Val Trp Met Asn 
165 170 175 

Tyr He He His Ala Phe Met Tyr Gly Tyr Tyr Leu Leu Lys Ser Leu 
180 185 190 

Lys val Pro He Pro Pro Ser Val Ala Gin Ala He Thr Thr Ser Gin 
195 200 205 

Met val Gin Phe Ala Val Ala He Phe Ala Gin Val His Val Ser Tyr 
210 215 220 

Lys His Tyr Val Glu Gly Val Glu Gly Leu Ala Tyr Ser Phe Arg Gly 
225 230 235 240 

Thr Ala He Gly Phe Phe Met Leu Thr Thr Tyr Phe Tyr Leu Trp He 
245 250 255 

Gin Phe Tyr Lys Glu His Tyr Leu Lys Asn Gly Gly Lys Lys Tyr Asn 
260 265 270 

Leu Ala Lys Asp Gin Ala Lys Thr Gin Thr Lys Lys Ala Asn 
275 280 285 



<210> 19 
<211> 274 
<212> PRT 
<213> C. elegans 

<400> 19 

Met Pro Gin Gly Glu Val Ser Phe 
1 5 

Phe Ser His Glu Leu Ser Lys Lys 
20 



Phe Glu Val Leu Thr Thr Ala Pro 
10 15 

His He Ala Gin Thr Gin Tyr Ala 
25 30 



10 



Ala Phe Trp He Ser Met Ala Tyr Val Val Val He Phe Gly Leu Lys 
35 40 45 



Ala Val Met Thr Asn Arg Lys Pro Phe Asp Leu Thr Gly Pro Leu Asn 
50 55 60 

Leu Trp Asn Ala Gly Leu Ala He Phe Ser Thr Leu Gly Ser Leu Ala 
65 70 75 80 

Thr Thr Phe Gly Leu Leu His Glu Phe Phe Ser Arg Gly Phe Phe Glu 
85 90 95 

Ser Tyr He His He Gly Asp Phe Tyr Asn Gly Leu Ser Gly Met Phe 
100 105 110 

Thr Trp Leu Phe Val Leu Ser Lys Val Ala Glu Phe Gly Asp Thr Leu 
115 120 125 

Phe He He Leu Arg Lys Lys Pro Leu Met Phe Leu His Trp Tyr His 
130 135 140 

His Val Leu Thr Met Asn Tyr Ala Phe Met Ser Phe Glu Ala Asn Leu 
145 150 155 160 

Gly Phe Asn Thr Trp He Thr Trp Met Asn Phe Ser Val His Ser He 
165 170 175 

Met Tyr Gly Tyr Tyr Met Leu Arg Ser Phe Gly Val Lys Val Pro Ala 
180 185 190 

Trp He Ala Lys Asn He Thr Thr Met Gin He Leu Gin Phe Val He 
195 200 205 

Thr His Phe He Leu Phe His Val Gly Tyr Leu Ala Val Thr Gly Gin 
210 215 220 

Ser Val Asp Ser Thr Pro Gly Tyr Tyr Trp Phe Cys Leu Leu Met Glu 
225 230 235 240 

He Ser Tyr Val Val Leu Phe Gly Asn Phe Tyr Tyr Gin Ser Tyr He 
245 250 255 

Lys Gly Gly Gly Lys Lys Phe Asn Ala Glu Lys Lys Thr Glu Lys Lys 
260 265 270 



He Glu 



<210> 20 
<211> 281 
<212> PRT 
<213> C. elegans 



11 



<400> 20 

Met Tyr Leu Asn Tyr Phe Ala Thr Glu lie Phe His Arg Ser Ala Val 
15 10 15 

Cys Glu Thr Glu Ala Cys Arg Ser Ser Lys lie Met lie Ala Asp Val 
20 25 30 

Phe Lys Trp Lys Phe Asp Ala Asn Glu Leu Trp Ser Leu Leu Thr Asn 
35 40 45 

Gin Asp Glu Val Phe Pro His lie Arg Ala Arg Arg Phe lie Gin Glu 
50 55 60 

His Phe Gly Leu Phe Val Gin Met Ala He Ala Tyr Val He Leu Val 
65 70 75 80 

Phe Ser He Lys Arg Phe Met Arg Asp Arg Glu Pro Phe Gin Leu Thr 
85 90 95 

Thr Ala Leu Arg Leu Trp Asn Phe Phe Leu Ser Val Phe Ser He Tyr 
100 105 110 

Gly Ser Trp Thr Met Phe Pro Phe Met Val Gin Gin He Arg Leu Tyr 
115 120 125 

Gly Leu Tyr Gly Cys Gly Cys Glu Ala Leu Ser Asn Leu Pro Ser Gin 
130 135 140 

Ala Glu Tyr Trp Leu Phe Leu Thr He Leu Ser Lys Ala Val Glu Phe 
145 150 155 160 

Val Asp Thr Phe Phe Leu Val Leu Arg Lys Lys Pro Leu He Phe Leu 
165 170 175 

His Trp Tyr His His Met Ala Thr Phe Val Phe Phe Cys Ser Asn Tyr 
180 185 190 

Pro Thr Pro Ser Ser Gin Ser Arg Val Gly Val He Val Asn Leu Phe 
195 200 205 

Val His Ala Phe Met Tyr Pro Tyr Tyr Phe Thr Arg Ser Met Asn He 
210 215 220 

Lys Val Pro Ala Lys He Ser Met Ala Val Thr Val Leu Gin Leu Thr 
225 230 235 240 

Gin Phe Met Cys Phe He Tyr Gly Cys Thr Leu Met Tyr Tyr Ser Leu 
245 250 255 

Ala Thr Asn Gin Ala Arg Tyr Pro Ser Asn Thr Pro Ala Thr Leu Gin 
260 265 270 



Cys Leu Ser Tyr Thr Leu His Leu Leu 
275 280 



12 



<210> 21 
<211> 288 
<212> PRT 
<213> C. elegans 

<400> 21 

Met Ala Gin His Pro Leu Val Gin Arg Leu Leu Asp Val Lys Phe Asp 
15 10 15 

Thr Lys Arg Phe Val Ala He Ala Thr His Gly Pro Lys Asn Phe Pro 
20 25 30 

Asp Ala Glu Gly Arg Lys Phe Phe Ala Asp His Phe Asp Val Thr He 
35 40 45 

Gin Ala Ser He Leu Tyr Met Val Val Val Phe Gly Thr Lys Trp Phe 
50 55 60 

Met Arg Asn Arg Gin Pro Phe Gin Leu Thr He Pro Leu Asn He Trp 
65 70 75 80 

Asn Phe He Leu Ala Ala Phe Ser He Ala Gly Ala Val Lys Met Thr 
85 90 95 

Pro Glu Phe Phe Gly Thr He Ala Asn Lys Gly He Val Ala Ser Tyr 
100 105 110 

Cys Lys Val Phe Asp Phe Thr Lys Gly Glu Asn Gly Tyr Trp Val Trp 
115 120 125 

Leu Phe Met Ala Ser Lys Leu Phe Glu Leu Val Asp Thr He Phe Leu 
130 135 140 

Val Leu Arg Lys Arg Pro Leu Met Phe Leu His Trp Tyr His His He 
145 150 155 160 

Leu Thr Met He Tyr Ala Trp Tyr Ser His Pro Leu Thr Pro Gly Phe 
165 170 175 

Asn Arg Tyr Gly He Tyr Leu Asn Phe Val Val His Ala Phe Met Tyr 
180 185 190 

Ser Tyr Tyr Phe Leu Arg Ser Met Lys He Arg Val Pro Gly Phe He 
195 200 205 

Ala Gin Ala He Thr Ser Leu Gin He Val Gin Phe He He Ser Cys 
210 215 220 

Ala Val Leu Ala His Leu Gly Tyr Leu Met His Phe Thr Asn Ala Asn 
225 230 235 240 

Cys Asp Phe Glu Pro Ser Val Phe Lys Leu Ala Val Phe Met Asp Thr 
245 250 255 



13 



Thr Tyr Leu Ala Leu Phe Val Asn 
260 

Arg Gly Gly Lys Asp Lys Tyr Lys 
275 280 



Phe Phe Leu Gin Ser Tyr Val Leu 
265 270 

Ala Val Pro Lys Lys Lys Asn Asn 
285 



<210> 22 
<211> 269 
<212> PRT 
<213> C. elegans 

<220> 

<221> VARIANT 

<222> (1) . . . (269) 

<223> Xaa = Any Amino Acid 

<400> 22 

Met Ser Ala Glu Val Ser Glu Arg Phe Lys Val Trp Thr Gly Asn Asn 
15 10 15 

Glu Thr lie lie Tyr Ser Pro Phe Glu Tyr Asp Ser Thr Leu Leu lie 
20 25 30 

Glu Ser Cys Arg Cys Thr Tyr Gin Leu Leu lie Leu Leu Arg Gin lie 
35 40 45 

Tyr Tyr Arg Asp lie Trp Ser His Gly Asn Leu Lys Ala Cys Asp Xaa 
50 55 60 

Leu Leu Leu Ala Trp Asn Gly Phe Leu Ala Val Phe Ser lie Met Gly 
65 70 75 80 

Thr Trp Arg Phe Gly lie Glu Phe Tyr Asp Ala Val Phe Arg Xaa Gly 
85 90 95 

Phe lie Xaa Ser lie Cys Leu Ala Val Asn Pro Arg Ser Pro Ser Ala 
100 105 110 

Phe Trp Ala Cys Met Phe Ala Leu Ser Lys lie Ala Glu Phe Gly Asp 
115 120 125 

Thr Met Phe Leu Val Leu Arg Lys Arg Pro Val lie Phe Leu His Trp 
130 135 140 

Tyr His His Ala Val Val Leu lie Leu Ser Trp His Ala Ala lie Glu 
145 150 155 160 

Leu Thr Ala Pro Gly Arg Trp Phe lie Phe Met Asn Tyr Leu Val His 
165 170 175 



14 



Ser He Met Tyr Thr Tyr Tyr Ala He Thr Ser He Gly Tyr Arg Xaa 
180 185 190 



Pro Lys He Val Ser Met Thr Val 
195 200 

Leu He Gly Val Ser He Ser Cys 
210 215 

Gly Glu Met Cys Gin Gin Ser Tyr 
225 230 

He Tyr Ala Ser Phe Leu Val Leu 
245 

Leu Val Lys Lys Asp Lys Lys Pro 
260 



Thr Phe Leu Gin Thr Leu Gin Met 
205 

He Val Leu Tyr Leu Lys Leu Asn 
220 

Asp Asn Leu Ala Leu Ser Phe Gly 
235 240 

Ser Ser Phe Phe Asn Asn Ala Tyr 
250 255 

Asp Val Lys Lys Asp 
265 



^^k5 ■ 
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. V ■ '. 09/936845 

JC16 Rec'd PCT/PTO SEP 1 8 2001 

SEQUENCE LISTING 

<110> Napier, Johnathan A. 

<120> Polyunsaturated Fatty Acid (PUPA) Elongase from Caenorhabditis elegans 
<130> 76/7 

<140> PCT/GBOO/01035 
<141> 2000-03-20 

<160> 22 

<170> Patentin Ver. 2.1 

<210> 1 

<211> 27 

<212> DNA 

<213> C. elegans 

\ <400> 1 

S gcgggtacca tggctcagca tccgctc 27 

W . <210> 2 

P : .. . <211> 27 

SH <212> DNA 

jC <213> C. elegans 

J' <400> 2 

gcgggatcct tagttgttct tcttctt 27 



5^ <210> 3 

W. .: ... .. .<211> 27 

O <212> DNA 

<213> C. elegans 

<400> 3 

gcgggtacca tgccacaggg agaagtc 27 



<210> 4 

<211> 27 

<212> DNA 

<213> C. elegans 

<400> 4 

gcgggatcct tattcaattt ttctttt 27 



<210> 5 

<211> 29 

<212> DNA 

<213> C. elegans 



1 



<400> 5 

gcgaattcac catgggtacg gaccaagga 



29 



<210> 6 

<211> 27 

<212> DNA 

<213> C. elegans 

<400> 6 

gcggagctcc tactcttcct tgggacg 27 



<210> 7 
<211> 876 
<212> DNA 
<213> C. elegans 

<400> 7 

atggagcttg ccgagttctg gaatgatctc 
acagatatga ccacaaaata caaatattca 
ccgcagtatt ggacgatttt attccagaaa 
ctttatttca ttttaattaa ggtgattcaa 
ttgaaatacc cattgattct ttggaatgga 
ttgcggttct ctattgatcc tctacgatca 
tgctattcgt gtaatccaac tgatgtggct 
aagattgttg aacttggaga cactatgttc 
ttacactact atcatcatgc agcagtgtta 
actgcagctg gtcgtttcta catcctaatg 
tactacacag tttctgccat gggatacaga 
actgttcaaa caactcaaat gttagctgga 
aaaactgaat acaagcttcc ttgtcaacaa 
atctatgtca catttgccat tcttttcatt 
tcgtcgaaga agtcgaaatc ggtgaagaac 



aacaccttca ccatctacgg accgaatcac 60 
tatcacttcc caggtgaaca ggtggcggat 120 
tattggtatc attcgatcac aatatcagtt 180 
aagtttatgg agaatcgaaa accattcact 240 
gctcttgcag cattcagtat aattgccaca 300 
ctatatgctg aaggattcta caaaactctg 360 
gcattttgga gctttgcatt cgctctttcc 420 
attattttga gaaaacggcc attgatcttt 48 0 
atctacactg tccattctgg tgccgagcat 540 
aactacttcg cacattctct catgtatact 600 
ttaccgaaat gggtatcaat gactgtcaca 660 
gtcggaataa cttggatggt gtacaaagtg 720 
tccgtagcca atttgtatct cgcattcgtc 780 
caattcttcg tcaaggcata cattatcaag 840 
gaataa 876 



<210> 8 
<211> 1308 
<212> DNA 
<213> C. elegans 

<400> 8 

atggcaaaat acgactacaa tccgaagtat 
tttgagacat cttttgatgc atttcgatcg 
tcaattacag catctgtcgt gtatgtagcc 
atctacaaaa aatcacgagt tattactttt 
cgaaaccgaa aatcacttaa tagttctcaa 
ttccaactgg acacaccact cttcgtctgg 
gggttcctcc gaatgacacc tgaatttgta 
aaatattcaa tttgtcattc atcttatgct 
ttcgcaatga gcaaactttt cgagctcatc 
ccactcatct tccttcactg gtatcatcat 
tacaaggatc acactgcatc aggacggtgg 
cttatgtatt cctactatgc tcttcgttct 



gggttagaaa attacagcat attccttccc 60 
acaacatgga tgcaaaatca ctggtatcaa 120 
gtcattttta caggaaagaa ggtggttctc 180 
gagtctagcc ttcagaatgc aattaagaat 240 
atgtttcaga ttatggaaaa gtacaagccc 300 
aattcatttt tagccatttt ctcaattctc 3 60 
tggagttggt cagcagaagg aaactcattc 420 
caaggagtca ctggtttctg gactgaacaa 480 
gacacaatct tcatcgttct tcgtaaacgt 540 
gtaactgtta tgatctacac atggcacgcg 600 
ttcatttgga tgaattatgg agttcatgct 660 
ctgaaattcc gtcttccaaa acaaatggca 720 



atggttgtta ctactctcca acttgctcaa 
gtctaccgta tcaagtcatc gggtgaatac 
tgctttggag tttatttcac atatttcctt 
gttaagaaaa acaaccgtac agtaaattat 
gttttaattt acctgagaaa aaaggtttca 
aataattata aaattcaatt ttcatcaaat 
aaaacatatg aacttattct tccaagaaga 
ggaaaaaatc gaattttttc gaaatatcag 
gatttcgaaa ttctggagcc aaaagaagat 
acaacgaggt ccgccgccgc acgaagaaaa 



atggttatgg gagtaatcat cggagtcact 780 
tgccaacaga catgggacaa tttgggatta 840 
cttttcgcca acttcttcta ccatgcatat 900 
gaaaataatt caaaaaattt ccccgatctc 960 
agaaaatcga aaaatcggca atgttcagaa 1020 
tttgttaatg ttgatggaaa aaaacataag 1080 
aaaatgacca caattttaac ttttctattt 1140 
aaaaatcgaa aaaacatttc gattcctgtt 1200 
atcaatgcta acatcgctga gccatccatc 1260 
gttcaaaaag ctgattag 1308 



<210> 9 
<211> 825 
<212> DNA 
<213> C. elegans 

<400> 9 

atggcagcag cacaaacaag tccagcagcc 
agtctggatc agactgattc ttacatgtct 
attggttatc tcgtcaccat ctacttcggg 
gatctccaaa atacacttgc tctctggaac 
gcctataagc ttattccaga actattcgga 
tactgtcaaa acgagaacta ctacaccgat 
gtgatgtcga aagctccaga actaggggat 
gttatcttca tgcactggta tcatcatgcc 
tctgagcatc aggcatgggc tcgttggtct 
atgtatttct acttcgccgt tcgcgccttg 
ttcatcacta ctattcaaat tgtccaattt 
gtattcatta agtctgctga ttctgttcct 
atcggaggac tcatgtacat cagttatttg 
tacattcaaa aacgctcacc aaccaaaacc 



acgctcgtcg atgttttgac aaaaccatgg 60 
acatttgtac cattatccta taaaatcatg 120 
caaaaattaa tggctcacag aaaaccattc 180 
ttcgggtttt cactgttctc gggaatcgcc 240 
gttttcatga aggacgggtt tgtcgcttcc 300 
gcatcaactg gattctgggg ctgggccttt 360 
actatgttct tggtccttcg taaaaaacca 420 
ctcacatttg tctacgcagt agtcacatac 480 
ttggctctca accttgccgt ccacactgtt 540 
aacatccaaa ctccacgccc agtggcaaag 600 
gtcatctcat gctacatttt tgggcatttg 660 
ggttgcgctg ttagctggaa tgtgctatcg 720 
ttcctttttg ccaagttctt ctacaaggcc 780 
agcaagcagg agtag 825 



<210> 10 
<211> 861 
<212> DNA 
<213> C. elegans 

<400> 10 

atgtcatcgg acgatcgtgg cactagaacc 
aacttcactt atgaaggtgc caaagaagtt 
cttgccgtcg gatatattgc cactattttt 
gccttcgatc tcagtactcc attaaacatt 
ttgggattct tattcacttt tcctactttg 
cacacctatt cccatgtctc tgagctttac 
ctttgggtta tctcaaagat tccggaactt 
agaccactta ttttcatgca ctggtaccat 
tgctaccatg aggatgctgt ccatatggtt 
gcattcatgt atggatacta tcttctgaaa 
gctcaagcaa tcaccacatc tcaaatggtt 
catgtttcct ataaacacta tgttgaggga 
acagctatcg gatttttcat gcttactacc 
gagcactatc ttaagaatgg aggcaaaaag 
caaacaaaga aggctaacta a 



ttcaagatga tggatcaaat tcttggaaca 60 
gctcgaggcc ttgaaggttt ctcagcaaag 120 
ggactgaaat attatatgaa agaccgaaaa 180 
tggaatggta ttctttcgac attcagctta 240 
ttatcagtta tcagaaagga tggatttagt 300 
actgacagta cctctggata ttggatcttc 360 
ttggatacag tattcattgt tcttcgcaag 420 
cacgcattga ccggttacta tgctcttgtc 480 
tgggttgtat ggatgaatta tattattcat 540 
tctctgaaag ttccaattcc accatcagtt 600 
caattcgcag ttgccatttt cgcacaagtt 660 
gttgaaggat tagcctactc gttcagagga 720 
tact tct ate tatggattca attctacaaa 780 
tacaatttgg caaaggatca ggcaaaaact 840 

861 



<210> 11 
<211> 825 
<212> DNA 
<213> C. elegans 

<400> 11 

atgccacagg gagaagtctc attctttgag 
ctctcaaaaa agcatattgc acagactcag 
gttgtcgtta tttttgggct caaggctgtc 
ggaccactga atctctggaa tgcgggtctt 
actacatttg gacttctcca cgagttcttc 
atcggagact tttataatgg actttctgga 
gttgctgaat tcggagatac actttttatt 
cattggtatc atcatgtgct tacaatgaat 
ggatttaata cttggattac atggatgaat 
tatatgcttc gttcttttgg tgtcaaggtt 
atgcaaattc ttcaattcgt tattactcat 
gttactggac aatctgttga ctcaactcca 
atctcttatg tcgttctgtt cggaaacttc 
aagaagttta atgcagagaa gaagactgaa 



gtgctgacaa ctgctccatt cagtcatgag 60 
tatgctgctt tctggatctc aatggcatat 120 
atgacaaacc gaaaaccatt tgatctcacg 180 
gctattttct caactctcgg atcacttgcc 240 
agccgtggat ttttcgaatc ttacattcac 300 
atgttcacat ggcttttcgt tctctcaaaa 360 
attcttcgta aaaagccatt gatgttcctt 420 
tatgctttta tgtcatttga agctaatttg 480 
ttctcagttc actcaattat gtatggatat 540 
ccagcatgga ttgccaagaa tattacaaca 600 
ttcattcttt tccacgttgg atatttggca 660 
ggatattatt ggttctgcct tctcatggaa 720 
tactatcaat catacatcaa gggaggtggc 780 
aagaaaattg aataa 825 





<210> 12 








<211> 846 








<212> DNA 








<213> C. elegans 






<400> 12 






"iis 


atgtatttga 


attatttcgc 


gacggaaatc 




gcttgtcgct 


cgtcaaaaat 


aatgattgct 


iH 


gaattgtgga 


gtcttttaac 


gaatcaggat 




ttcattcaag 


aacattttgg 


tctattcgtc 


m ■■ . 


ttctcaatca 


aaaggttcat 


gagggatcgt 




ctctggaact 


tcttcctctc 


cgtcttctca 




atggttcaac 


aaataagact 


ttatggtctc 




cttccgagtc 


aagcagaata 


ttggcttttc 




gttgatacat 


ttttcttggt 


tctccggaaa 




catatggcaa 


catttgtctt 


cttctgcagt 




gtcggagtta 


tcgtcaacct 


gttcgtgcat 




tcaatgaaca 


tcaaagttcc 


tgcgaaaatt 




caattcatgt 


gctttatcta 


tggatgtact 




gcacgatacc 


cctcaaatac 


acctgcgaca 




ctttga 







ttccatcgta gtgcggtttg tgaaacagaa 60 
gacgtgttca aatggaaatt cgatgcaaac 120 
gaagttttcc cgcatattag agcacggcga 180 
cagatggcaa ttgcatatgt cattttggtg 240 
gaaccatttc aactcaccac agctcttcgt 300 
atttatggtt cctggacaat gtttccattt 360 
tacggatgtg gatgcgaagc actttcaaac 420 
ctgacgatct tgtccaaagc tgtggagttt 480 
aaaccactca tcttcctaca ctggtatcat 540 
aattacccga ctccatcgtc acaatcacgc 600 
gccttcatgt acccatacta tttcacccga 660 
tcaatggctg ttacagttct tcaattgact 720 
ctcatgtact actcgttggc cactaatcag 780 
ctccaatgtt tgtcctacac tctacatttg 840 

846 



<210> 13 
<211> 866 
<212> DNA 
<213> C. elegans 

<400> 13 

atggctcagc atccgctcgt tcaacggctt ctcgatgtca aattcgacac gaaacgattt 60 
gtggctattg ctactcatgg gccaaagaat ttccctgacg cagaaggtcg caagttcttt 120 



4 



gctgatcact ttgatgttac tattcaggct 
acaaaatggt tcatgcgtaa tcgtcaacca 
aatttcatcc tcgccgcatt ttccatcgca 
ggaaccattg ccaacaaagg aattgtcgat 
gagagaatgg atactgggtg tggctcttca 
ccatcttctt ggttctccgt aaacgtccac 
tcaccatgat ctacgcctgg tactctcatc 
tttatcttaa ctttgtcgtc cacgccttca 
agattcgcgt gccaggattc atcgcccaag 
tcatctcttg cgccgttctt gctcatcttg 
gtgatttcga gccatcagta ttcaagctcg 
ttttcgtcaa cttcttcctc caatcatatg 
cagtgccaaa gaagaagaac aactaa 



tcaatcctgt acatggtcgt tgtgttcgga 180 
ttccaattga ctattccact caacatctgg 240 
ggagctgtca aaatgacccc agagttcttt 300 
cctactgcaa agtgtttgat ttcacgaaag 360 
tggcttccaa acttttcgaa cttgttgaca 420 
tcatgttcct tcactggtat caccatattc 480 
cattgacccc aggattcaac agatacggaa 540 
tgtactctta ctacttcctt cgctcgatga 600 
ctatcacatc tcttcaaatc gttcaattca 660 
gttatctcat gcacttcacc aatgccaact 720 
cagttttcat ggacacaaca tacttggctc 780 
ttctccgcgg aggaaaagac aagtacaagg 840 

866 



<210> 14 
<211> 801 
<212> DNA 
<213> C, elegans 

<400> 14 

atgtcggccg aagtgtccga acgattcaaa 
tattccccat tcgagtacga ttccacgttg 
ctgcttatat tattgcgaca aatttattac 
cttttactag catggaacgg ttttttggca 
ggaatcgaat tctacgatgc tgttttcaga 
gtaaatccac gttcaccgtc cgcattctgg 
gagtttgggg acacgatgtt cttggtgctg 
tatcatcacg ctgttgttct gatcctttct 
ggacgctggt ttatttttat gaactatttg 
ataacatcaa tcggctatcg tcttcccaaa 
actcttcaaa tgctcattgg tgtcagcatt 
ggagagatgt gccaacaatc ctacgacaat 
ttcctggtgc tattctccag tttcttcaac 
cccgatgtga agaaggatta a 



gtttggacag gaaacaatga gaccatcatc 60 
ctcatcgagt catgtcggtg tact tat cag 120 
agagatatat ggagtcacgg aaacctaaaa 180 
gtgttcagta ttatgggtac atggagattt 240 
agaggcttca tcgattcgat ctgcctggct 300 
gcatgcatgt tcgctctatc gaaaatcgcc 360 
aggaaacggc cggttatatt ccttcactgg 420 
tggcatgctg caatcgaact cacagctcca 480 
gtgcattcaa taatgtatac atactacgca 540 
atcgtttcaa tgactgttac attccttcaa 600 
tcttgcattg tgctttattt gaagcttaat 660 
ctggcgttga gcttcggaat ctacgcctca 720 
aatgcatatt tggtaaaaaa ggacaagaaa 780 

801 



<210> 15 
<211> 291 
<212> PRT 
<213> C. elegans 

<400> 15 

Met Glu Leu Ala Glu Phe Trp Asn Asp Leu Asn Thr Phe Thr He Tyr 
1 5 10 15 

Gly Pro Asn His Thr Asp Met Thr Thr Lys Tyr Lys Tyr Ser Tyr His 
20 25 30 

Phe Pro Gly Glu Gin Val Ala Asp Pro Gin Tyr Trp Thr He Leu Phe 
35 40 45 



Gin Lys Tyr Trp Tyr His Ser He Thr He Ser Val Leu Tyr Phe He 
50 55 60 



Leu lie Lys Val He Gin Lys Phe Met Glu Asn Arg Lys Pro Phe Thr 
65 70 75 80 



Leu Lys Tyr Pro Leu He Leu Trp Asn Gly Ala Leu Ala Ala Phe Ser 
85 90 95 

lie He Ala Thr Leu Arg Phe Ser lie Asp Pro Leu Arg Ser Leu Tyr 
100 105 110 

Ala Glu Gly Phe Tyr Lys Thr Leu Cys Tyr Ser Cys Asn Pro Thr Asp 
115 120 125 

Val Ala Ala Phe Trp Ser Phe Ala Phe Ala Leu Ser Lys He Val Glu 
130 135 140 

Leu Gly Asp Thr Met Phe He He Leu Arg Lys Arg Pro Leu He Phe 
145 150 155 160 

Leu His Tyr Tyr His His Ala Ala Val Leu He Tyr Thr Val His Ser 
165 170 175 

Gly Ala Glu His Thr Ala Ala Gly Arg Phe Tyr He Leu Met Asn Tyr 
180 185 190 

Phe Ala His Ser Leu Met Tyr Thr Tyr Tyr Thr Val Ser Ala Met Gly 
195 200 205 

Tyr Arg Leu Pro Lys Trp Val Ser Met Thr Val Thr Thr Val Gin Thr 
210 215 220 

Thr Gin Met Leu Ala Gly Val Gly He Thr Trp Met Val Tyr Lys Val 
225 230 235 240 

Lys Thr Glu Tyr Lys Leu Pro Cys Gin Gin Ser Val Ala Asn Leu Tyr 
245 250 255 

Leu Ala Phe Val He Tyr Val Thr Phe Ala He Leu Phe He Gin Phe 
260 265 270 

Phe Val Lys Ala Tyr He He Lys Ser Ser Lys Lys Ser Lys Ser Val 
275 280 285 



Lys Asn Glu 
290 



<210> 16 
<211> 435 
<212> PRT 
<213> C, elegans 

<400> 16 

Met Ala Lys Tyr Asp Tyr Asn Pro Lys Tyr Gly Leu Glu Asn Tyr Ser 
15 10 15 



6 



lie Phe Leu Pro Phe Glu Thr Ser Phe Asp Ala Phe Arg Ser Thr Thr 
20 25 30 

Trp Met Gin Asn His Trp Tyr Gin Ser lie Thr Ala Ser Val Val Tyr 
35 40 45 

Val Ala Val lie Phe Thr Gly Lys Lys Val Val Leu lie Tyr Lys Lys 
50 55 60 



Ser Arg Val lie Thr Phe Glu Ser Ser Leu Gin Asn Ala lie Lys Asn 
65 70 75 80 

Arg Asn Arg Lys Ser Leu Asn Ser Ser Gin Met Phe Gin lie Met Glu 
85 90 95 

Lys Tyr Lys Pro Phe Gin Leu Asp Thr Pro Leu Phe Val Trp Asn Ser 
100 105 110 

Phe Leu Ala lie Phe Ser He Leu Gly Phe Leu Arg Met Thr Pro Glu 
115 120 125 

Phe Val Trp Ser Trp Ser Ala Glu Gly Asn Ser Phe Lys Tyr Ser He 
130 135 140 

Cys His Ser Ser Tyr Ala Gin Gly Val Thr Gly Phe Trp Thr Glu Gin 
145 150 155 160 

Phe Ala Met Ser Lys Leu Phe Glu Leu lie Asp Thr He Phe He Val 
165 170 175 

Leu Arg Lys Arg Pro Leu He Phe Leu His Trp Tyr His His Val Thr 
180 185 190 

Val Met He Tyr Thr Trp His Ala Tyr Lys Asp His Thr Ala Ser Gly 
195 200 205 

Arg Trp Phe He Trp Met Asn Tyr Gly Val His Ala Leu Met Tyr Ser 
210 215 220 

Tyr Tyr Ala Leu Arg Ser Leu Lys Phe Arg Leu Pro Lys Gin Met Ala 
225 230 235 240 

Met Val Val Thr Thr Leu Gin Leu Ala Gin Met Val Met Gly Val He 
245 250 255 

He Gly Val Thr Val Tyr Arg He Lys Ser Ser Gly Glu Tyr Cys Gin 
260 265 270 

Gin Thr Trp Asp Asn Leu Gly Leu Cys Phe Gly Val Tyr Phe Thr Tyr 
275 280 285 



Phe Leu Leu Phe Ala Asn Phe Phe Tyr His Ala Tyr Val Lys Lys Asn 
290 295 300 



Asn Arg Thr Val Asn Tyr Glu Asn Asn Ser Lys Asn Phe Pro Asp Leu 
305 310 315 320 



Val Leu lie Tyr Leu Arg Lys Lys Val Ser Arg Lys Ser Lys Asn Arg 
325 330 335 

Gin Cys Ser Glu Asn Asn Tyr Lys lie Gin Phe Ser Ser Asn Phe Val 
340 345 350 

Asn Val Asp Gly Lys Lys His Lys Lys Thr Tyr Glu Leu lie Leu Pro 
355 360 365 

Arg Arg Lys Met Thr Thr lie Leu Thr Phe Leu Phe Gly Lys Asn Arg 
370 375 380 

lie Phe Ser Lys Tyr Gin Lys Asn Arg Lys Asn lie Ser lie Pro Val 
385 390 395 400 

Asp Phe Glu lie Leu Glu Pro Lys Glu Asp He Asn Ala Asn He Ala 
405 410 415 

Glu Pro Ser He Thr Thr Arg Ser Ala Ala Ala Arg Arg Lys Val Gin 
420 425 430 

Lys Ala Asp 
435 



<210> 17 
<211> 274 
<212> PRT 
<213> C. elegans 

<400> 17 

Met Ala Ala Ala Gin Thr Ser Pro Ala Ala Thr Leu Val Asp Val Leu 
1 5 10 15 

Thr Lys Pro Trp Ser Leu Asp Gin Thr Asp Ser Tyr Met Ser Thr Phe 
20 25 30 

Val Pro Leu Ser Tyr Lys He Met He Gly Tyr Leu Val Thr He Tyr 
35 40 45 

Phe Gly Gin Lys Leu Met Ala His Arg Lys Pro Phe Asp Leu Gin Asn 
50 55 60 

Thr Leu Ala Leu Trp Asn Phe Gly Phe Ser Leu Phe Ser Gly He Ala 
65 70 75 80 

Ala Tyr Lys Leu He Pro Glu Leu Phe Gly Val Phe Met Lys Asp Gly 
85 90 95 

Phe Val Ala Ser Tyr Cys Gin Asn Glu Asn Tyr Tyr Thr Asp Ala Ser 
100 105 110 
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Thr 



Gly Phe Trp Gly Trp Ala Phe Val Met Ser Lys Ala Pro Glu Leu 
115 120 125 



Gly Asp Thr Met Phe Leu Val Leu Arg Lys Lys Pro Val lie Phe Met 
130 135 140 

His Trp Tyr His His Ala Leu Thr Phe Val Tyr Ala Val Val Thr Tyr 
145 150 155 160 

Ser Glu His Gin Ala Trp Ala Arg Trp Ser Leu Ala Leu Asn Leu Ala 
165 170 175 

Val His Thr Val Met Tyr Phe Tyr Phe Ala Val Arg Ala Leu Asn lie 
180 185 190 

Gin Thr Pro Arg Pro Val Ala Lys Phe He Thr Thr He Gin He Val 
195 200 205 

Gin Phe Val He Ser Cys Tyr He Phe Gly His Leu Val Phe He Lys 
210 215 220 

Ser Ala Asp Ser Val Pro Gly Cys Ala Val Ser Trp Asn Val Leu Ser 
225 230 235 240 

He Gly Gly Leu Met Tyr He Ser Tyr Leu Phe Leu Phe Ala Lys Phe 
245 250 255 

Phe Tyr Lys Ala Tyr He Gin Lys Arg Ser Pro Thr Lys Thr Ser Lys 
260 265 270 

Gin Glu 



<210> 18 
<211> 286 
<212> PRT 
<213> C. elegans 

<400> 18 

Met Ser Ser Asp Asp Arg Gly Thr Arg Thr Phe Lys Met Met Asp Gin 
1 5 10 15 

lie Leu Gly Thr Asn Phe Thr Tyr Glu Gly Ala Lys Glu Val Ala Arg 
20 25 30 

Gly Leu Glu Gly Phe Ser Ala Lys Leu Ala Val Gly Tyr He Ala Thr 
35 40 45 

He Phe Gly Leu Lys Tyr Tyr Met Lys Asp Arg Lys Ala Phe Asp Leu 
50 55 60 

Ser Thr Pro Leu Asn He Trp Asn Gly He Leu Ser Thr Phe Ser Leu 
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65 



70 



75 



80 



m 



Leu Gly Phe Leu Phe Thr Phe Pro Thr Leu Leu Ser Val lie Arg Lys 
85 90 95 

Asp Gly Phe Ser His Thr Tyr Ser His Val Ser Glu Leu Tyr Thr Asp 
100 105 110 

Ser Thr Ser Gly Tyr Trp lie Phe Leu Trp Val lie Ser Lys lie Pro 
115 120 125 

Glu Leu Leu Asp Thr Val Phe lie Val Leu Arg Lys Arg Pro Leu lie 
130 135 140 

Phe Met His Trp Tyr His His Ala Leu Thr Gly Tyr Tyr Ala Leu Val 
145 150 155 160 

Cys Tyr His Glu Asp Ala Val His Met Val Trp Val Val Trp Met Asn 
165 170 175 

Tyr lie lie His Ala Phe Met Tyr Gly Tyr Tyr Leu Leu Lys Ser Leu 
180 185 190 

Lys Val Pro lie Pro Pro Ser Val Ala Gin Ala lie Thr Thr Ser Gin 
195 200 205 

Met Val Gin Phe Ala Val Ala lie Phe Ala Gin Val His Val Ser Tyr 
210 215 220 

Lys His Tyr Val Glu Gly Val Glu Gly Leu Ala Tyr Ser Phe Arg Gly 
225 230 235 240 

Thr Ala He Gly Phe Phe Met Leu Thr Thr Tyr Phe Tyr Leu Trp He 
245 250 255 

Gin Phe Tyr Lys Glu His Tyr Leu Lys Asn Gly Gly Lys Lys Tyr Asn 
260 265 270 

Leu Ala Lys Asp Gin Ala Lys Thr Gin Thr Lys Lys Ala Asn 
275 280 285 



<210> 19 
<211> 274 
<212> PRT 
<213> C. elegans 

<400> 19 

Met Pro Gin Gly Glu Val Ser Phe Phe Glu Val Leu Thr Thr Ala Pro 
1 5 10 15 

Phe Ser His Glu Leu Ser Lys Lys His He Ala Gin Thr Gin Tyr Ala 
20 25 30 
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Ala Phe Trp He Ser Met Ala Tyr Val Val Val He Phe Gly Leu Lys 
35 40 45 



Ala Val Met Thr Asn Arg Lys Pro Phe Asp Leu Thr Gly Pro Leu Asn 
50 55 60 

Leu Trp Asn Ala Gly Leu Ala He Phe Ser Thr Leu Gly Ser Leu Ala 
65 70 75 80 

Thr Thr Phe Gly Leu Leu His Glu Phe Phe Ser Arg Gly Phe Phe Glu 
85 90 95 

Ser Tyr He His He Gly Asp Phe Tyr Asn Gly Leu Ser Gly Met Phe 
100 105 110 

Thr Trp Leu Phe Val Leu Ser Lys Val Ala Glu Phe Gly Asp Tlir Leu 
115 120 125 

Phe He He Leu Arg Lys Lys Pro Leu Met Phe Leu His Trp Tyr His 
130 135 140 

His Val Leu Thr Met Asn Tyr Ala Phe Met Ser Phe Glu Ala Asn Leu 
145 150 155 160 



W Gly Phe Asn Thr Trp He Thr Trp Met Asn Phe Ser Val His Ser He 

P 165 170 175 

£ Met Tyr Gly Tyr Tyr Met Leu Arg Ser Phe Gly Val Lys Val Pro Ala 

ill 180 185 190 

Trp He Ala Lys Asn He Thr Thr Met Gin He Leu Gin Phe Val He 
195 200 205 

Thr His Phe He Leu Phe His Val Gly Tyr Leu Ala Val Thr Gly Gin 
210 215 220 

Ser Val Asp Ser Thr Pro Gly Tyr Tyr Trp Phe Cys Leu Leu Met Glu 
225 230 235 240 

He Ser Tyr Val Val Leu Phe Gly Asn Phe Tyr Tyr Gin Ser Tyr He 
245 250 255 

Lys Gly Gly Gly Lys Lys Phe Asn Ala Glu Lys Lys Thr Glu Lys Lys 
260 265 270 

lie Glu 



<210> 20 
<211> 281 
<212> PRT 
<213> C, elegans 
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<400> 20 

Met Tyr Leu Asn Tyr Phe Ala Thr Glu lie Phe His Arg Ser Ala Val 
15 10 15 

Cys Glu Thr Glu Ala Cys Arg Ser Ser Lys lie Met He Ala Asp Val 
20 25 30 

Phe Lys Trp Lys Phe Asp Ala Asn Glu Leu Trp Ser Leu Leu Thr Asn 
35 40 45 

Gin Asp Glu Val Phe Pro His He Arg Ala Arg Arg Phe He Gin Glu 
50 55 60 

His Phe Gly Leu Phe Val Gin Met Ala He Ala Tyr Val He Leu Val 
65 70 75 80 

Phe Ser He Lys Arg Phe Met Arg Asp Arg Glu Pro Phe Gin Leu Thr 
85 90 95 

Thr Ala Leu Arg Leu Trp Asn Phe Phe Leu Ser Val Phe Ser He Tyr 
100 105 110 

W Gly Ser Trp Thr Met Phe Pro Phe Met Val Gin Gin He Arg Leu Tyr 

115 120 125 

LlI 

ffi Gly Leu Tyr Gly Cys Gly Cys Glu Ala Leu Ser Asn Leu Pro Ser Gin 

j¥; 130 135 140 



Ala Glu Tyr Trp Leu Phe Leu Thr He Leu Ser Lys Ala Val Glu Phe 
145 150 155 160 



Val Asp Thr Phe Phe Leu Val Leu Arg Lys Lys Pro Leu He Phe Leu 
165 170 175 

1p His Trp Tyr His His Met Ala Thr Phe Val Phe Phe Cys Ser Asn Tyr 

□ 180 185 190 

- ' 

Pro Thr Pro Ser Ser Gin Ser Arg Val Gly Val He Val Asn Leu Phe 
195 200 205 

Val His Ala Phe Met Tyr Pro Tyr Tyr Phe Thr Arg Ser Met Asn He 
210 215 220 

Lys Val Pro Ala Lys He Ser Met Ala Val Thr Val Leu Gin Leu Thr 
225 230 235 240 

Gin Phe Met Cys Phe He Tyr Gly Cys Thr Leu Met Tyr Tyr Ser Leu 
245 250 255 

Ala Thr Asn Gin Ala Arg Tyr Pro Ser Asn Thr Pro Ala Thr Leu Gin 
260 265 270 

Cys Leu Ser Tyr Thr Leu His Leu Leu 
275 280 
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<210> 21 

<211> 288 

<212> PRT 

<213> C. elegans 

<400> 21 

Met Ala Gin His Pro Leu Val Gin Arg Leu Leu Asp Val Lys Phe Asp 
1 5 10 15 

Thr Lys Arg Phe Val Ala lie Ala Thr His Gly Pro Lys Asn Phe Pro 
20 25 30 

Asp Ala Glu Gly Arg Lys Phe Phe Ala Asp His Phe Asp Val Thr lie 
35 40 45 

Gin Ala Ser He Leu Tyr Met Val Val Val Phe Gly Thr Lys Trp Phe 
50 55 60 

Met Arg Asn Arg Gin Pro Phe Gin Leu Thr He Pro Leu Asn He Trp 
65 70 75 80 

Asn Phe lie Leu Ala Ala Phe Ser He Ala Gly Ala Val Lys Met Thr 
85 90 95 

Pro Glu Phe Phe Gly Thr He Ala Asn Lys Gly He Val Ala Ser Tyr 
100 105 110 

Cys Lys Val Phe Asp Phe Thr Lys Gly Glu Asn Gly Tyr Trp Val Trp 
115 120 125 

Leu Phe Met Ala Ser Lys Leu Phe Glu Leu Val Asp Thr He Phe Leu 
130 135 140 

Val Leu Arg Lys Arg Pro Leu Met Phe Leu His Trp Tyr His His He 
145 150 155 160 

Leu Thr Met He Tyr Ala Trp Tyr Ser His Pro Leu Thr Pro Gly Phe 
165 170 175 

Asn Arg Tyr Gly He Tyr Leu Asn Phe Val Val His Ala Phe Met Tyr 
180 185 190 

Ser Tyr Tyr Phe Leu Arg Ser Met Lys He Arg Val Pro Gly Phe He 
195 200 205 

Ala Gin Ala He Thr Ser Leu Gin He Val Gin Phe He He Ser Cys 
210 215 220 

Ala Val Leu Ala His Leu Gly Tyr Leu Met His Phe Thr Asn Ala Asn 
225 230 235 240 

Cys Asp Phe Glu Pro Ser Val Phe Lys Leu Ala Val Phe Met Asp Thr 
245 250 255 
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Thr Tyr Leu Ala Leu Phe Val Asn 
260 

Arg Gly Gly Lys Asp Lys Tyr Lys 
275 280 



Phe Phe Leu Gin Ser Tyr Val Leu 
265 270 

Ala Val Pro Lys Lys Lys Asn Asn 
285 



<210> 22 
<211> 269 
<212> PRT 
<213> C. elegans 

<400> 22 

Met Ser Ala Glu Val Ser Glu Arg Phe Lys Val Trp Thr Gly Asn Asn 
1 5 10 15 

Glu Thr lie He Tyr Ser Pro Phe Glu Tyr Asp Ser Thr Leu Leu He 
20 25 30 

Glu Ser Cys Arg Cys Thr Tyr Gin Leu Leu lie Leu Leu Arg Gin He 
35 40 45 

Tyr Tyr Arg Asp He Trp Ser His Gly Asn Leu Lys Ala Cys Asp Xaa 
50 55 60 

Leu Leu Leu Ala Trp Asn Gly Phe Leu Ala Val Phe Ser He Met Gly 
65 70 75 80 

Thr Trp Arg Phe Gly He Glu Phe Tyr Asp Ala Val Phe Arg Xaa Gly 
85 90 95 

Phe He Xaa Ser He Cys Leu Ala Val Asn Pro Arg Ser Pro Ser Ala 
100 105 110 

Phe Trp Ala Cys Met Phe Ala Leu Ser Lys He Ala Glu Phe Gly Asp 
115 120 125 

Thr Met Phe Leu Val Leu Arg Lys Arg Pro Val He Phe Leu His Trp 
130 135 140 

Tyr His His Ala Val Val Leu He Leu Ser Trp His Ala Ala He Glu 
145 150 155 160 

Leu Thr Ala Pro Gly Arg Trp Phe He Phe Met Asn Tyr Leu Val His 
165 170 175 

Ser He Met Tyr Thr Tyr Tyr Ala He Thr Ser He Gly Tyr Arg Xaa 
ISO 185 190 

Pro Lys He Val Ser Met Thr Val Thr Phe Leu Gin Thr Leu Gin Met 
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195 



200 



205 



Leu lie Gly Val Ser lie Ser Cys 
210 215 

Gly Glu Met Cys Gin Gin Ser Tyr 
225 230 

lie Tyr Ala Ser Phe Leu Val Leu 
245 

Leu Val Lys Lys Asp Lys Lys Pro 
260 



lie Val Leu Tyr Leu Lys Leu Asn 
220 

Asp Asn Leu Ala Leu Ser Phe Gly 
235 240 

Ser Ser Phe Phe Asn Asn Ala Tyr 
250 255 

Asp Val Lys Lys Asp 
265 
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